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SUMMARY  OF  REPORT 


The  report  consists  of  papers  written  in  six  research  areas. 
Most  of  these  papers  have  been  published  in  journals  or  conference 
proceedings,  or  are  soon  to  be  published.  A  few  are  taken  from 
earlier  semi-annual  reports. 

The  six  research  areas  are  Packet  Radio  Networks,  Adaptive 
Routing,  Algorithms,  Network  Design,  Switching  Techniques,  and 
Local  Area  Networks. 

In  the  area  of  Packet  Radio  Networks,  our  results  on  the 
throughput  analysis  of  Multihop  Packet  Radio  Networks  was  first 
presented  at  ICC'80  (A.l).  An  expanded  version  with  extensions  has 
been  accepted  for  publication  in  the  IEEE  Transactions  on  Communica¬ 
tions  (A. 2).  Further  extensions  are  presented  in  papers  A. 3,  A  .4, 
and  A. 5.  In  this  work  we  developed  an  algorithm  for  finding  the 
throughput  of  Multihop  Packet  Radio  Networks  with  arbitrary 
topologies,  sizes,  connectivities,  traffic  requirements,  and  routings. 
Details  of  the  algorithm  are  given  in  paper  A. 6. 

Our  work  on  Adaptive  Routing  was  first  presented  at  NTC'76. 
It  was  further  reported  on  in  the  IEEE  Transaction  on  Communications 
in  an  invited  paper  in  April  1981  (B.l).  Extensions  to  this  work  are 
presented  in  papers  B.2,  B.3,  B.6,  and  B7.  Our  approach  involved 
two  levels  for  adaptive  routing  -  a  slowly  varying  global  level  and  a 
dynamic  local  level.  This  partition  produced  a  stable  routing  that 
was  amenable  to  analysis.  We  showed  that  significant  performance 
improvements  were  obtainable.  A  control  theoretic  approach  applied 
to  traffic  networks  is  presented  in  papers  B.4  and  B.5. 


Our  work  in  the  area  of  algorithms  has  focused  on  finding  ap¬ 
proaches  to  solving  intractable  problems  which  arise  in  the  design  and 
analysis  of  networks.  Several  of  these  papers  (C.l,  C.2,  and  C.4) 
deal  with  the  solution  of  specific  fundamental  problems.  Others  (C.5, 
C.6)  deal  with  the  application  of  probabilistic  methods  to  the  solution 
of  a  broad  class  of  problems.  Finally,  in  C.3,  we  present  a  method 
of  directly  trading  computational  complexity  for  solution  accuracy  in 
the  solution  of  combinatorial  optimization  problems. 

Our  work  in  the  area  of  Network  Design  has  led  to  solutions  to 
specific  problems  in  several  areas  including  centralized  networks 
(D.l,  D.2),  and  circuit  switched  networks  (D.4,  D.5,  and  D.6).  It 
has  also  resulted  in  the  development  of  a  facility  location  algorithm 
(D.3)  which  has  been  applied  to  the  location  of  earth  stations  and 
packet  radio  repeaters  in  broadcast  networks. 

In  section  E,  we  present  our  work  on  an  Information  Theoretic 
approach  to  communications  in  networks  (E.l  and  E.2),  on  a  compari¬ 
son  of  packet  switching  and  character  switching,  including  an  optimum 
packetization  strategy  (E.3),  on  optimal  multiplexing  in  Integrated 
Switching  schemes  (E.4)  and  on  Integrated  Satellite  Access  (E.5). 

In  papers  F.l  and  F.2,  we  present  some  work  on  Local  Area 
Networks  for  which  we  evaluate  performance  and  assess  the  impact  of 
user  interface  architecture. 

Most  of  the  work  reported  upon  here  was  supported  by  the  US 
ARMY  (CECOM).  Portions  of  this  work  were  also  supported  by  the 
National  Science  Foundation,  by  Network  Analysis  Corporation,  by 
General  Telephone  and  Electronics,  and  by  IBM's  Federal  Systems 
Division.  Research  not  supported  under  this  grant  is  indicated  by  an 


asterisk  in  the  table  of  contents  and  is  included  to  complement  the 
other  research  reports  and  provide  a  more  complete  record  of  our 
activities. 
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Packet  Radio  Networks 


THROUGHPUT  ANALYSIS  OF  MULTIHOP  PACKET  RADIO  NETWORKS * 


Robert  R.  Boorstyn  and 
Aaron  Kershenbaum 

Polytechnic  Institute  of 
New  York 
333  Jay  Street, 

Brooklyn,  New  York  11201 


V eli  Sahin 

Bell  Laboratories 
Holmdel,  New  Jersey  07733 


ABSTRACT 

We  consider  the  problem  of  obtaining  exact  expressions  for  throughput  and  blocking 
probabilities  in  multihop  packet  ratio  networks  operating  under  CSMA.  We  obtain  exact  results 
for  a  general  class  of  message  lengths,  for  general  topologies,  and  for  perfect  capture.  These 
results  are  obtained  by  assuming  perfect  acknowledgments. 


I.  INTRODUCTION 

We  consider  the  problem  of  obtaining  exact  expressions  for  throughput  and  blocking 
probabilities  in  multihop  packet  radio  networks  operating  under  carrier  sense  multiple  access 
(CSMA).  Procedures  are  developed  which  can  be  used  to  analyze  general  topologies  for  a 
general  class  of  packet  length  distribution.  Examples  of  chains,  rings,  and  stars  are  presented. 

II.  THE  NETWORK  MODEL 

We  consider  the  problem  of  analyzing  the  throughput  capability  of  a  multihop  packet 
radio  network  operating  under  carrier  sense  multiple  access  (CSMA).  Thus,  we  assume  that 
the  network  is  comprised  of  terminals  equipped  with  radio  transponders  suitable  for 
broadcasting  data  over  a  limited  distance.  In  general,  the  source  and  destination  terminals 

•  This  research  was  partially  supported  by  USARMY  CENCOMS  under  contract  DAAK  80-80-K-0579,  and  by  the 
National  Science  Foundation  under  gram  ENG-79-08210 


cannot  bear  each  other  directly,  and  the  data  has  to  be  relayed  by  one  or  more  intermediate 
devices.  A  separate  set  of  devices,  called  repeaters,  may  exist  for  this  purpose,  or  the  terminals 
themselves  may  relay  messages  for  one  another. 

Control  of  the  network  is  completely  distributed,  i.e.,  no  station  or  central  control 
mechanism  is  assumed  to  exist.  Rather,  we  assume  that  each  source  terminal  has  prestored 
one  or  more  routes  to  all  destinations  and  includes  all  necessary  routing  information  in  the 
packets  if  transmits.  These  assumptions  are  made  merely  to  simplify  the  presentation.  In  fact, 
the  results  presented  are  valid  for  networks  using  alternate  routing  as  long  as  routing  changes 
are  not  made  over  short  time  intervals.  One  of  the  motivations  for  this  study  came  from  a 
consideration  of  the  design  of  routing  procedures  for  such  networks.  It  was  necessary, 
however,  to  first  develop  an  understanding  for  the  throughput  of  various  topologies. 

Exogenous  traffic  is  modeled  as  independent  Poisson  processes  arriving  at  each 
source  node,  with  appropriate  rates  and  packet  lengths.  The  topology  is  specified  by  a  listing  of 
which  terminals  (or  repeaters)  can  hear  each  other.  In  the  remainder  we  will  not  distinguish 
between  terminals  and  repeaters  and  will  refer  to  them  collectively  as  cither  terminals  or  nodes. 
In  general  the  transmissions  of  one  terminal  can  be  heard  by  many  other  terminals.  The 
routing  will  specify  which  terminal  is  to  repeat  the  packet,  if  necessary. 

If  two  or  more  transmissions  are  simultaneously  heard  by  a  terminal  (called  a 
’collision’)  at  least  one,  and  possibly  both,  is  ’lost'  and  must  be  retransmitted.  We  assume 
retransmissions  are  scheduled  at  a  random  instant  in  time  sufficiently  far  in  the  future  so  as  to 
preserve  the  Poisson  nature  of  the  combined  traffic  stream,  which  now  consists  of  exogenous 
traffic  and  rescheduled  traffic.  For  this  study,  we  assume  that  a  packet  can  be  retransmitted  as 
many  times  as  is  necessary,  i.e.,  that  there  is  no  maximum  allowable  number  of 
retransmissions. 

At  any  time,  terminals  may  either  transmit  or  receive,  they  cannot  do  both 
simultaneously.  Before  transmitting,  a  terminal  senses  the  channel.  If  it  detects  that  any  of  its 


neighbors  (i.e.,  terminals  that  it  can  hear)  are  transmitting  (by,  e.g.,  sensing  a  carrier)  it 
reschedules  the  transmission  as  for  collided  packets  above.  If  at  the  scheduled  time  for  a 
transmission,  the  terminal  is  already  engaged  in  transmitting  a  packet,  the  new  packet  is  also 
rescheduled  as  above.  Thus  packets  are  continually  rescheduled  until  they  are  successfully 
delivered  to  the  next  terminal  on  their  route.  We  assume  that  the  total  stream  of  traffic 
scheduled  by  any  terminal  is  a  Poisson  process.  This  includes  originating  traffic  and  packets 
rescheduled  either  due  to  collisions  or  due  to  the  channel  having  been  sensed  busy.  This 
scheme  is  called  carrier  sense  multiple  access  (CSMA).  The  Poisson  assumption  is  valid  for  the 
assumptions  made  above  and  will  yield  accurate  results  for  throughput.  Compromises  will  have 
to  be  made,  however,  if  an  accurate  picture  of  time  delays  is  to  be  considered. 

It  is  possible,  due  to  non-zero  propagation  delay,  that  collisions  of  transmissions 
from  neighboring  terminals  may  still  take  place  despite  the  CSMA  strategy.  This  will  occur  if  a 
terminal  senses  the  channel  before  another  terminal’s  transmission  is  received.  This  effect  is 
small  if  terminals  are  reasonably  close  or  are  not  transmitting  at  high  speed.  We  will  ignore  this 
phenomena  here,  and  assume  that  all  transmissions  are  instantly  heard  by  their  neighbors. 

A  passive  acknowledgment  is  used  for  transmission  to  neighbors.  The  transmitting 
terminal  listens  to  the  channel  to  hear  if  a  packet  is  being  rebroadcast  by  a  neighbor.  If  after  a 
prespecified  time  interval,  the  transmitting  node  does  not  hear  the  packet  rebroadcast,  it 
retransmits  the  packet.  But  the  packet  may  have  been  successfully  received  by  the  neighbor 
even  though  the  originator  does  not  hear  the  rebroadcast.  Duplicate  packets  may  be 
transmitted  and  deleted  only  at  the  final  destination  or  they  may  be  detected  and  deleted  earlier. 
An  end-to-end  acknowledgment  is  returned  to  the  originator  from  the  final  destination.  In  this 
paper  we  assume  that  passive  acknowledgments  are  always  heard  and  ignore  the  effect  of  end- 
to-end  acknowledgments.  Alternately,  these  acknowledgements  could  have  been  added  to  the 
required  traffic.  (In  a  sequel  paper,  the  effect  of  passive  acknowledgements  will  be  studied). 


We  depict  the  topology  of  the  network  by  a  graph  where  terminals  are  represented 
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by  nodes.  The  nodes  are  connected  by  a  link  if  they  can  hear  each  other's  transmissions,  i.e.,  if 
they  are  neighbors.  As  an  example  see  Figure  1.  Node  A  can  hear  node  B,  but  not  node  C. 
Node  B  can  hear  both  nodes  A  and  C.  Node  C  can  hear  node  B,  but  not  node  A.  If  node  A  is 
transmitting  to  node  B  and  node  C  begins  transmitting,  then  the  transmission  from  A  to  B  may 
be  lost  depending  upon  the  ’capture’  assumptions  we  make.  A  conservative  assumption  is  that 
the  A  to  B  transmission  is  lost  -  this  is  known  as  zero  capture.  Alternatively,  perfect  capture 
assumes  that  this  transmission  is  successfully  received.  Half-amplitude  capture  assumes  that 
the  transmission  is  lost  if  C  dominates  A  at  B.  This  can  happen  if  C  is  closer  to  B  than  A  is  to 
B,  or  has  a  greater  signal  strength  perceived  at  B  than  A  has.  If  A  dominates  C,  then  the 
transmission  is  successful.  However,  in  all  cases  of  a  collision  we  assume  the  later  transmission 
is  lost.  Thus  if  C  is  transmitting  to  B,  this  packet  is  lost  in  all  cases.  We  will  consider  only 
perfect  capture  situations  below.  Note  that  under  CSMA  if  node  B  is  transmitting,  neither  A  or 
C  is  allowed  to  transmit. 

We  assume  that  a  routing  has  been  specified.  This  takes  the  form  of  deciding  which 
of  the  neighbors  are  to  rebroadcast  a  packet  from  a  particular  source  to  a  particular  destination. 
Thus  the  amount  of  traffic  that  a  terminal  wishes  to  send  to  its  neighbor  can  be  computed.  If 
these  rebroadcast  packets  are  scheduled  at  a  random  time  far  in  the  future  the  Poisson 
assumption  for  traffic  streams  is  preserved.  We  assume  that  the  traffic  between  neighbors  is 
specified  and  form  independent  Poisson  processes.  We  assume  that  the  packet  length  is 
reassigned  independently  at  each  hop.  This  is  analogous  to  the  ’independence  assumption*  in 
queuing  networks. 

The  details  of  CSMA  for  a  single  hop  network  can  be  found  in  the  papers  by  Tobagi 
and  Kleinrock^*\  Tobagi  has  also  developed  some  simple  models  for  two-hop  networks^. 
Details  of  a  packet  radio  network  can  be  found  in  a  paper  by  Kahn^\  A  discussion  of  routing 
in  multibop  packet  radio  can  be  found  in  the  paper  by  Gitman,  Van  Slyke,  and  Frank'  \  An 
earlier  version  of  this  paper  was  presented  at  ICC’80(5).  More  details  can  be  found  i  the 
thesis  of  Sahin(6). 


HI  (,  t  N  E  R  A  L  REM  L.  rs 

In  this  section  we  develop  some  expressions  th.it  are  valui  t  the  ;v..  1 

network  we  have  modeled  above  using  OSM.A  and  with  an  arbitrary  packet  lengtr  o.,-:  . 

Let  i  be  a  node,  n,  one  of  its  neighbors,  .V,’  the  set  of  all  the  neighbors  of  i.  and  N  the  >et 
all  i's  neighbors,  including  i  Let  g,  be  the  total  rate  (in  packets  seel  of  all  schedule!  'tv’ 
node  i.  This  includes  originating  traffic  and  all  rescheduled  traffic  and  is  assumed  to  be  P  mss,  r 
Let  1/m,  be  the  average  length  of  packets  transmitted  by  node  i.  Let  G  =■  g,  ..  3e  . 
normalized  rate. 

Node  i  is  either  busy  (transmitting)  or  idle.  It  will  transmit  a  scheduled  pack?: 
the  instant  it  is  scheduled  all  nodes  in  .V,  are  idle  Let  A  be  a  set  of  nodes  Let  P  A 
probability  that  at  a  random  instant  all  nodes  in  A  are  idle  The  nodes  not  m  A  •  j 
not  be  idle.  Similarly  P(t),  P(i,A),  P(A,B)  are  the  probabilities  that  i  is  idle  n.-or 
in  A  are  idle,  and  all  nodes  in  A  and  B  are  idle. 

Since  traffic  is  scheduled  at  node  i  with  a  Poisson  rate  g, .  will  be  trarsm  1 
iV,  is  idle,  and  transmissions  have  average  length  1/m,  .  the  probability  that  i  is  bus;  :s  g  •  r 

1  ~P(i)  =  G,P(A’,) 

If  i  is  busy  then  under  CSMA,  n,  must  be  idle.  Then  since 

P  ( n, )  =  P{n,,i)  +  P  ( n, !/  busy  )  ( 1  —  P  ( i )  ] 


and  Pi  n,  \i  busy )  =  1 .  we  have 


Pin,  ,i  )  =  Pin,  )  a-  Pi,)  -  1 


(  - 1 


•-  -v, 


J 

I 

, 

J 

« 


« 


Similarlv ,  if  .4 


1 


i  1  •  Pu I  -  1 


'etting  A  —  .V,  in  equation  ■  m.  v.va 


P  N  ')  -  /*  <  V  )//»(/ 1,1 V/) 


we  get 


P(i\ -V,  )  = 


Equation  (4)  is  often  found  in  CSMA  literature. 


1 +-G, 


(4) 


A  packet  from  i  to  n,  will  be  transmitted  when  it  is  scheduled  if  A,  is  idle.  During 
the  transmission  all  nodes  in  A'*  will  be  idle.  It  will  be  successfully  received  at  n,  if  all 
neighbors  of  n,  not  in  A-,  are  also  idle  at  the  beginning  of  transmission.  Otherwise  a  collision 
will  occur.  Let  jiiJt  be  the  rale  in  packets/sec.  determined  by  the  routing  and  assumed  Poisson, 
of  the  traffic  that  i  wishes  to  send  to  n,.  This  is  the  required  throughput  or  offered  traffic.  Let 
g  „  be  the  rate  of  all  scheduled  traffic  from  i  to  n, .  We  have  also  assumed  that  all  these 
streams  are  Poisson  and  independent  Of  these  g,_„  packets  per  second,  s,  „  must  be 
successful.  Thus 


T  .n  A; 

S,  .n  . 


•-  =  ) 


(5) 


where  S  n  =  and  CJ, 


g.  -  u 


•  i 


'» • 


%  * 
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The  total  scheduled  traffic  to¬ 


rn  equations  ill  through  I'm  wv  a- 


red )  at  a  node  is  given  by 


1'  (l‘ 
n  •  A 


i  n  i 


a  relation  between  the  S,  „  and  0, .  and 
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% 


D 


*  • 


determine  the  maximum  S,  „  the  network  can  support  This  we  call  the  <  maximum  i 
throughput  or  capacity  In  the  next  sec'ion  we  develop  this  relationship  for  exponential  packe: 
lengths  and  arbitrary  topologies.  Later  we  extend  the  analysis  to  a  general  class  of  packet 
length  distributions. 

IV.  EXPONENTIALLY  DISTRIBUTED  PACKET  LENGTHS 

If  the  packet  lengths  are  exponentially  distributed,  then  the  system  can  be  viewed  as 
a  Markov  process  where  the  states  are  identified  by  which  nodes  are  idle  and  which  are  busy 
Let  D  be  a  set  of  busy  nodes.  Because  of  CSMA.  no  nodes  in  D  may  be  neighbors  of  each 
other.  Let  Q(D)  be  the  probability  that  at  an  instant  of  time,  all  nodes  in  D  are  busy,  and  all 
nodes  not  in  D,  are  idle.  Then  each  set,  D,  represents  a  state  in  a  Markov  system,  and  Q(D)  is 
the  state  probability.  In  particular,  the  null  set  D  =  <t>,  represents  the  state  that  all  nodes  are 
idle. 

Assume  the  system  is  in  state  D.  It  will  leave  the  state  if  any  itD  stops 
transmitting.  This  happens  with  rate  Thus  the  transition  to  state  {D-i|  occurs  with  rate  n,. 
The  only  other  way  to  leave  state  D  is  for  one  of  the  idle  nodes  that  is  not  a  neighbor  of  any 
itD  to  begin  to  transmit.  This  occurs  with  rate  gr  Let  ,\'D  be  the  set  of  all  neighbors  of  all 
nodes  in  D.  Then  the  transition  from  D  to  iD+j(,  jt N0,  occurs  with  rate  g..  The  global 
balance  equations  for  this  system  are 

(2  +  2  gj)QW)  -  2  *  Q(D-n  +  2  njQ(D+j)  <7) 

,,D  J  £  ,<£>  j  £  cVD 

where  D  is  one  of  the  special  sets  defined  above. 

It  is  easy  to  see  that  these  equations  are  satisfied  by 

Q  (D  )  =  —  Q  ( D  -i )  =  G,  Q  ( D  -i ) .  i  <  D 

Mi 


(8) 
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Thus 


Q(D)  -  (  n  G,)  Q(<t>)  (9) 

f  tD 

where  we  adopt  the  convention  that  n  G,  —  1  Summing  over  all  D,  we  get 

2  2(0)  -  2  in  Q(*)]  -  i  no) 

all  D  all  D  "D 

In  the  previous  section  we  found  we  were  interested  in  quantities  like  P(A),  where 
A  is  any  set  of  nodes,  and  P(A)  is  the  probability  that  all  nodes  in  A  are  idle,  and  all  nodes  not 
in  A  may  or  may  not  be  idle.  This  can  be  found  by  summing  Q(D)  over  all  sets  D  that  do  not 
contain  nodes  in  A.  Thus 


P(A)~  2  2(0) 

DCAc 


2  me,) 

DCA‘  i,D 


2 

D  C.V 


(n 

I.z> 


Gi) 


(ID 


where  DCAc  refers  to  all  such  sets  contained  in  the  complement  of  A  and  N  is  the  set  of  all 
nodes.  We  adopt  the  shorthand  notation. 


sp(B)  =  2  ( n  g,) 

DCB  ,,D 


(12) 


where  SP  refers  to  sum  of  products.  Thus 

P(A)  =  SP(A')/SP{X)  (13) 

Equations  (5),  (6).  and  (11)  can  be  used  for  any  topology  to  generate  the  solution 
to  our  problem.  The  equations  relating  the  S,  „  ,  G,  „  ,  and  (7;  can  be  solved  iteratively  For 

example,  equation  (5)  now  becomes 
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SP(.X) 


where  by  A  +  B  we  mean  the  union  of  A  and  B. 

Evaluation  of  sums  of  products  in  equation  (12)  are  made  easier  by  the  following 
two  rules.  Consider  two  sets  of  nodes  A  and  B  such  that  no  node  in  A  can  hear  anv  node  of  B. 


SP(A+B)  =  SP(A)  SP(B),  A  (~)  B  *  <*> 


SP{A)  «■  SP(A—i)  +  G.SPiA-ty),  itA 


To  prove  these  rules  just  consider  all  products.  We  have  successfully  evaluated  many  complex 
topologies  with  these  procedures. 

There  are  other  relations  which  will  be  found  useful  in  extending  our  model  to  more 
complex  situations.  We  prove  some  of  these  below.  Let  C  be  a  cut,  i.e. ,  a  set  of  nodes  that 
divides  the  network  into  three  parts  A,  B,  and  C,  where  A  and  B  have  no  neighbors  in  common 
as  in  equation  (15).  Let  A  =*A  ,  +  2,  B  =  B  ^  +  B2  where  A  tA2  =  B  tB2  —  0.  Then 


P(A,\C,B,) 


P(A„C,Bt)  SP(A2+B2)  SP(A2) 


P(C,B,) 


SP(A+B2)  SP{A ) 


P(A,\C)  = 


P{A,.C)  SP(A2+B)  _  SP  ( A  2) 
P(C)  ~  SP{A+B )  SP ( A  ) 


Thus 
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P(AX\C,B{)  =  P(AX\C).  C  a  cut  (17) 

we  also  have 

P(AUBX\C)  -  P(A  )IC)/>(5,|C),  C  a  cut  (18) 

In  particular  if  C  “  jV(*.  then 

=P0\N‘),  BCN'  (19) 

V.  A  GENERAL  class  of  packet  length  distributions 

In  this  section  we  extend  the  results  just  proven  to  include  a  general  class  of 
distributions  for  the  packet  length.  We  will  show  that  the  procedures  developed  for  perfect 
capture  are  independent  of  packet  length  distribution.  To  prove  this  we  start  with  a  simpler 
extension.  In  the  above  we  assumed  that  all  packet  lengths  arc  exponentially  distributed  and 
have  the  same  mean  when  transmitted  by  a  node  to  any  of  its  neighbors.  Different  nodes  may 
transmit  different  average  length  packets,  however.  Now  assume  that  while  all  packet  lengths 
are  still  exponentially  distributed,  the  average  length  packet  transmitted  from  a  node  may  be 
different  to  each  of  its  neighbors.  This  will  be  useful  in  analyzing  different  protocols  (to  be 
presented  in  a  sequel  paper)  but  is  presented  here  as  the  first  step  in  the  desired  extension. 

Now  the  state  of  the  network  depends  upon  who  is  transmitting  and  to  whom.  We 
can  keep  the  same  structure  by  breaking  every  node  into  a  set  of  ’micronodes",  one  for  each 
neighbor.  These  nodes  may  be  indexed  by  (t  ,n, ).  If  i  is  transmitting  to  n,  then  this  node  is 
active,  otherwise  it  is  idle.  Micronodes  are  connected  in  our  topology  if  they  can  hear  each 
other.  Since  CSMA  still  prevails,  all  micronodes  for  a  given  node  are  fully  connected. 
Furthermore  all  micronodes  for  nodes  that  arc  connected  in  the  original  topology  are  also  fully 
connected  The  analysis  now  proceeds  as  above  since  the  Markovian  property  has  been 
maintained.  For  example,  equation  (14)  still  holds,  but  now  .V,  and  A*  are  collections  of 
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micronodes  Note  that  .V,  contains  the  full  set  of  micronodes  for  i  and  all  neighbors  of  i  S,  „ 
and  G,  „  have  the  same  meaning  as  before  However  they  are  normalized  by  1. /*»,.*,  the 

average  packet  length  for  packets  going  from  i  to  n, .  The  terms  in  the  sums  of  products  are 
G,  „  for  the  micronodes. 

Let  A  contain  full  sets  of  micronodes  and  include  the  node  i.  Then  from  equation 

(16) 


SP(A)  =SP \A-(i,n,)}  +  G,  „SP  (A  -  A", )  (20) 

Here  we  have  used  the  fact  that  iV(l  ™  Af, ,  and  the  notation  that  fV,  is  the  set  of  all 
micronodes  of  i  and  neighbors  of  i.  Since  this  is  equivalent  to  the  original  set  of  node 
neighbors  of  i  we  keep  the  same  notation.  Repeating  for  all  neighbors  of  i  we  get, 

SP(A ) -SP(A-i)  +  (2  Gt^SPiA-N.)  (21) 

" i 


If  we  let 


C, 


II 

M, 


2  G‘.n,  =  2  &.n/ 


C2) 


then  equation  (16)  is  preserved.  In  a  similar  manner  all  previously  derived  equations  can  be 
maintained  where  <7,  takes  on  the  definition  in  equation  (22).  Here  1/m,  is  an  average  packet 
length,  averaged  over  the  different  average  packet  lengths  to  different  neighbors  in  proportion 
to  their  scheduled  rate 

We  now  prove  our  main  theorem  for  this  section.  Assume  that  the  length  of 
packets  transmitted  from  i  to  n,  has  the  density 
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/,*<*)  2  ^iniJ  e 

j 


j’ 


(23) 


where 


0..v,  >  0  and  2  -  1 

; 

Thus  the  length  is  distributed  as  a  positive  sum  of  exponentials.  Another  way  of  looking  at  this 
is  that  the  a’s  are  the  probabilities  of  choosing  the  associated  exponential  density.  Now  create 
micronodes  for  each  triple  (i  ,n,  ,j).  Here  we  use 

=  a'.'WS'-V  Thc  micronodes  for  some  i  and  any  ntJ  are 

fully  connected  as  are  the  micronodes  for  neighboring  nodes.  Equations  (20)  and  (21)  now 
become 


SP(A)  ~SP\A  -  (i,n,,j))  +  Gi  niJSP(A-N,) 


and 


SP(A)  —  SP  (A  —i )  + 


H,J 


'2Gi.teP<<A-Ni) 


(24) 

(25) 


In  thc  same  manner  as  above,  we  let 


G‘’  —  - “v 

",0  "(V 


v 


V 
"  M 
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These  equations  are  used  to  find  P(A)  in  terms  of  <7,  and  are  identical  in  form  to  those  derived 
for  exponentially  distributed  packet  lengths. 


Equation  (5)  in  turn  comes  from 
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-S,.v,  -  Gi  ni  JP(\^.\n  ) 


Summing  ovcrj,  we  get  equation  (5) 


•S..,  “  2  “  C,  A/>(A’,+.Vni) 


Thus  all  relations  between  the  G’s  and  the  S’s  are  preserved.  The  actual  nature  of  the  problem 
is  taken  into  account  by  the  relationship  between  the  S's  and  the  normalization  by  the  m’s. 

We  can  now  restate  the  above  theorem.  Let  i  be  any  node  and  jtN‘,  the 
neighborhood  of  i.  Let  s,j  be  the  successful  (desired)  rate  from  i  to  j,  in  packets/sec.  Let  the 
density  of  the  packet  lengths  be 


fijix)  -  2  c^p  “'J*  twhere  niJJk  >  0,  0. 


and  2  a>j.k  “  1- 

k 


Let  Wnij  =  2  aij.k/Ptjji  be  the  average  packet  length,  in  seconds.  Let  Slv,  »  st  j/njj.  Then 

k 


P(N,\JNj)  —  a  function  of  (G \,G 2,  •  •  •  ) 


where 


c<  -  2 


Proof: 


et  Stj.k  “ 


fberi  5,  , .*  -  a,  .*  J, ,,  -  /» ( A’  U  A,  )g,  ,  > 
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where  G,  *  G:J  =  ^  w  * 

y«.\*  ;«>',*  * 

Dividing  5JJ>ik  by  m,  j.a  and  summing  over  k.  we  get 

S,J  «*OV,UJV,)G1(, 

VI.  EXAMPLES  OF  THE  PROCEDURE 

As  an  example  consider  the  chain  of  four  nodes  shown  in  Figure  2.  We  assume 
•Sf  1 2  “  -S' 21  “  >^23  “  -Sf 32  “  34  “  S43  “  5  for  simplicity,  and  perfect  capture.  Also  note  that 
G12  “  Gi,  G43  “  G4  and  by  symmetry  G  i  =  G4_  G2\  *=  G34,  and  (723  “  632-  Also  from 
equation  (6),  G 2  “  G 21  +  G 23  “  Gy  From  equation  (5),  we  have 

~  “  *(1.2,3)  -  =  *(1. 2.3,4) 

G 1  O21  O23 

But 

SP(N)  *  2  UIG,-)  "  1  +  <7,  +  (?2  +  G}  +  G4  +  G,G3  +  G2G4  +  G,G4 

liD 

=  1  +  2C,  +  2G:  +  2 G,G;  +  G,2  -  A 


and 


*(1, 2,3,4)  =  l/A,  P(  1,2,3)  =  (\+G4)/A  -  (1+G,)M 


Solving,  we  get 


C2  “  G|(2+Gl)  and  5  —  G|(  1+G,)/A 


or 


5  “  G  |(  1+G ! )/( 1 +6G  |+7(J  ,‘+2G  3 ). 


We  can  now  find  the  maximum  value  of  S  possible,  the  throughput  of  the  chain,  which  is  .128, 
obtained  when  G  |  “  0.71. 


In  general  the  equations  cannot  be  solved  as  simply  as  for  this  four  node  example. 
Equation  (11)  is  used  to  get  expressions  for  P(A)  in  terms  of  G,.  Then  equation  (14)  is  used 
to  get  expressions  for  S,  in  terms  of  C,-.»  and  P(A).  Equation  (6)  provides  the  relation 
between  G,  and  Gl><1(.  The  S, t„  are  found  from  the  offered  traffic,  the  routing,  and  other 
assumptions  and  are  considered  as  inputs.  The  equations  are  iterated  until  a  solution  of  G's  for 
a  set  of  S’s  is  found.  The  maximum  set  of  S's  possible  is  considered  the  throughput,  or 
capacity,  of  the  network.  For  some  modest  size  problems,  as  above,  the  equations  can  be 
solved  directly. 

As  a  second  example  consider  the  star  topology  shown  in  Figure  5.  Here  assume 
there  are  L  legs  of  N  —  2  nodes  each.  Denote  the  center  node  by  0,  the  nodes  one  hop  out  by 
1,  and  the  other  nodes  by  2.  Further  assume  symmetrical  traffic  in  the  nodes  and 
Sot  “  S\o  ”  S12  *■  S2,  “  5.  Then  LS  is  the  total  traffic  successfully  transmitted  by  node  0.  The 
equations  are 


-  P(all  but  L-\  §2  nodes) 
G 10 


SP(L~  1#  2  nodes) 
SP(  /V) 


-  ( 1+G2)£-1/A 


~p~  =  P(0, 1 ,2) 
G  2 


SP(L- 1  legs) 
SP(N) 


( 1+G  ,+G  2)i_l/A 


where 


A  —  SP(N)  *  (1+G,+G2)£  +  Go(1+G2)£ 

G  i  “  G  jo  +  G  12 

But  G|0  —  Gq/L  and  G,2  —  G2,  so  C|  “  G2  +  Gq/L.  Thus  we  have  two  equations 


LS/G0  -  ( 1+G2)i_l/A  and  S/G2  =  (l+2G2+go/L  )i_l/A 

For  any  5  <  5 aMt,  they  can  be  solved  for  Go  and  C2.  Alternatively  for  any  G0  we  can  find  the 
corresponding  G2  by  solving 

GoO+Gj/'1  -  G 2(l+2G2+Go/i.)i_l  . 

Then  the  relation  between  S  and  G2  can  be  studied.  We  then  find  the  maximum  S,  5 
possible.  LSmu  is  the  maximum  throughput  of  the  star. 

For  larger  problems  we  will  get  several  equations  of  the  form 


G0-  LSA/(1+G2)l-' 
G2-SA/(l+2G2+Go/Z.)i-1 


For  any  S  we  solve  these  iteratively.  Since  G02z  15,  G\  >  25,  and  G2>5,  the  lower 
bounds  are  good  starting  points  for  G, .  For  S  sufficiently  less  than  5m,,  we  have  found  the 
iteration  converges  monotonically  and  rapidly.  As  S  approaches  5^,,  from  below  the 
convergence  is  still  monotonic  but  slows  appreciably.  For  S  >  the  iteration  does  not 

converge  and  often  diverges  dramatically.  We  have  uncovered  no  serious  numerical  problems 
with  this  procedure  in  the  many  examples  we  have  evaluated. 

VII.  NUMERICAL  EXAMPLES 

We  consider  here  three  different  topological  structures  with  exponentially  distributed 
packet  lengths  and  perfect  capture.  We  assume  all  5,  „  —  5.  for  all  i,  and  take  full  advantage  of 

symmetry  The  three  topologies,  shown  in  Figures  3.  4,  and  5,  are  a  chain,  a  ring,  and  a  star, 
all  with  various  lengths.  In  each  case  we  find  the  maximum  throughput.  S  These  are  given  in 
Table  I. 

The  maximum  one  way  throughput  for  a  long  chain  is  S  =  .086.  This  throughput  is 
approached  when  the  lengths  of  chains  exceed  10.  For  smaller  length  chains  the  throughput  is 
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higher.  In  CSMA  transmissions  of  neighbors  may  not  overlap  in  time.  Since  each  nodes 
transmits  successfully  2 S  packets  per  average  packet  transmission  time,  then  we  must  have  S  < 
1/5.  The  throughput  is  slightly  sma.||4r  than  half  of  this  limit.  The  cause  of  the  reduction  is 
collisions  from  transmissions  two  hops  away,  the  so-called  ’hidden  terminals'.  The  throughput 
of  S  =  .086  for  a  chain,  although  the  maximum  possible,  is  not  a  useful  operating  point.  As  in 
ALOHA,  this  is  the  point  at  which  delays  become  infinite  and  the  system  is  unstable.  The 
network  would  have  to  be  operated  at  some  lower  level. 

It  is  instructive  to  compare  the  performance  of  multihop  CSMA  with  that  of  slotted 
ALOHA.  Let  p  be  the  probability  of  transmission  in  one  direction  at  a  node.  Then  S  =  p(  1  - 
2p)^  for  a  long  chain.  SmAl  here  is  .074  which  is  approximately  14%  less  than  that  for  CSMA. 
There  are  two  factors  working  here.  CSMA  will  produce  less  collisions  since  neighbors  will  not 
interfere  with  each  other.  Hidden  terminals  will  still  produce  collisions.  (All  terminals  are 
hidden  in  ALOHA).  But  CSMA  prohibits  possible  successful  transmissions.  For  instance, 
node  3  can  transmit  successfully  to  node  2  while  node  4  is  transmitting  successfully  to  node  5. 
This  is  possible  in  ALOHA  but  prohibited  in  CSMA.  This  is  one  of  the  prices  paid  to  control 
collisions. 

We  note  that  for  a  ring  greater  than  7  nodes  the  maximum  throughput  is  the  same 
as  that  for  a  long  chain.  This  is  expected  since  the  congestion  is  now  in  the  middle  and  it  is 
unimportant  whether  or  not  the  chain  is  closed.  A  star  with  two  legs  is  just  a  chain  with  N  for 
the  star  replaced  by  2N+  1  for  the  chain. 

Consider  the  star  configuration  as  representing  the  center  node  (0)  trying  to  transmit 
to  some  node  or  nodes  far  away  via  many  repeaters.  For  one  leg,  the  maximum  rate  is  .086. 
For  two  legs,  the  maximum  rate  is  2S  or  .172,  exactly  twice.  The  results  are  shown  in  Table  II 
where  the  throughput  of  the  center  node  is  given  by  LS.  We  see  from  Table  II  that  whereas 
the  throughput  doubles  for  L  =  2,  it  increases  only  by  20%  when  L=3,  by  4-5%  further  when 
L”4,  and  by  2%  when  L“5.  Congestion  at  the  central  node  is  limiting  its  ability  to  increase  its 
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throughput.  Additional  legs,  beyond  three,  are  not  really  helpful 

We  have  investigated  ways  of  reducing  the  congestion  at  the  center  node.  For  larger 
L.  the  traffic  in  each  leg  is  limited  by  congestion  at  the  center.  The  collisions  that  cause  most 
problems  are  for  transmissions  from  the  first  level  of  nodes  ( 1 ,  N  +  I ,  2N+  1 ,...,  (L-l )N+  1 )  to 
the  center.  These  are  collided  with  by  other  first  level  nodes.  To  reduce  these  collisions  we 
considered  connecting  the  first  level  nodes  in  a  ring  and  then  fully  connecting  them.  These 
results  are  also  summarized  in  Table  I.  When  the  first  level  is  unconnected  the  throughput 
saturates  at  .229.  When  the  first  level  is  fully  connected  the  throughput  with  9  legs  is  .252,  a 
15%  increase.  For  four  legs,  the  ring  connected  topology  is  best,  providing  some  compromise 
between  reducing  collisions  and  allowing  simultaneous  transmissions. 

The  best  that  we  can  expect  in  the  fully  connected  case  is  LS  <  1/3.  This  is 
because  all  transmissions  from  the  center  node  and  all  first  level  nodes  cannot  overlap.  We  will 


discuss  asymptotic  results  with  even  larger  stars  and  chains  in  the  next  section. 


VIII.  ASYMPTOTIC  results 


We  are  interested  in  asymptotic  results  for  several  reasons.  They  provide  us  with 
the  limiting  behavior  of  the  finite  networks  previously  studied.  Since  the  behavior  of  tbesr 
networks  seems  to  converge  rapidly  with  their  size,  if  asymptotic  results  are  easier  to  obtain, 
they  would  be  useful.  We  are  also  interested  in  very  large  networks.  A  final  reason  is  to  verify 
some  of  the  bounding  arguments  on  throughput  made  in  the  last  section. 

We  first  consider  an  infinitely  long  chain.  We  let  5,  ,+1  =  S,  ,-i  =  5  Then  all  nodes 


are  identical.  Also  =  G,  ,_|  =  GJ2.  Thus  with  G,  =  G , 


,.  m. ,«>  - 

G,  G  SP(-o o oo ) 


We  can  write  the  denominator  as 
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SP(— oo,  ...,oo)=S/>(—  co,  /  —  1  )SP(i  +  1  ,...,co)+G,SP  (  —  oo /—  2)SP(i+2, 


x,  ,  ^  SP(i—k oo)  SP{— co,. _  , 

Now  let  Qk  -  — e— - -  — ±- - — VVc  observe  that  Qk  >  1 

Sr(i oo)  SP{—  oc,...,i) 

converges  is  independent  of  i.  Then  equation  (29)  becomes 


25  ...  1 

G  ~  QiQi+GQ, 


But 


0* 


SP  (-00 . r+frjH}  +  G  SP(  -oo  -2) 

SP (  co, . . . ) 


0*-,  +  C0*-2 


and  0o“  I  Thus  —  gj  +  G .  Therefore 


2S/G  - 


1 


Gi«?i  +  20 


or 


20,(0, +2G) 


We  note  that  since  £)_ ,  =»  — — ,  from  equation  (31)  we  have 


Fi nail v  we  have 


0,  «0o  +  ~f-  =  1  +  “  or 
Q  i  Q  i 

c  -  0,(0, -l) 


0-1 


20,  (20,-1) 


.0,  2:  1 


The  maximum  value  of  S  is  ,086,  reached  when  Qt  *  1.7  or  G  =  1.2. 


.oo). 

for  A:  >0  and  if  it 

(30) 

(31) 


(32) 

(33) 

(34) 


IX.  EXTENSIONS  AND  CONCLUSIONS 


We  have  presented  a  simple  but  fairly  realistic  model  of  a  multihop  packet  radio 
network  and  have  obtained  maximum  throughputs  for  general  topologies  and  packet  lengths 
We  have  assumed  perfect  reception  of  acknowledgments  and  have  not  included  additional  traffic 
due  to  end-to-end  acknowledgments.  Some  aspects  of  acknowledgments  can  be  included  by 
increasing  the  required  traffic.  We  are  investigating  the  effect  of  imperfect  acknowledgments 
and  different  retransmission  strategies.  The  model  should  still  be  useful  under  these 


extensions. 
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ABSTRACT 

We  consider  the  problem  of  obtaining  exart 
expressions  for  throughput  and  blocking  proba¬ 
bilities  in  multihop  packet  ratio  networks  operat¬ 
ing  under  CSMA.  We  obtain  exact  results  for 
exponential  message  lengths  lor  general  topologies 
and  for  constant  packet  lengths  for  Simple  topol¬ 
ogies.  Both  results  are  obtained  by  assuming 
perfect  acknowledgments . 


I.  INTRODUCTION 

We  consider  the  problem  of  obtaining  exact 
expressions  for  throughput  and  blocking  proba¬ 
bilities  in  multihop  packet  radio  networks  operat¬ 
ing  under  carrier  sense  multiple  access  (CSMA). 
By  considering  exponential  packet  lengths,  proce¬ 
dures  are  developed  which  can  be  used  to  analyze 
general  topologies.  Examples  of  chains,  rings, 
and  stars  are  presented.  Extensions  to  non¬ 
exponential  and  fixed  packet  lengths,  and  more 
complex  models  are  discussed. 


II.  THE  NETWORK  MODEL 

We  consider  the  problem  of  analyzing  the 
throughput  capability  of  a  multihop  packet  radio 
network  operating  under  carrier  sense  multiple 
access  (CSMA).  Thus,  we  assume  that  the  net¬ 
work  is  comprised  of  terminals  equipped  with 
radio  transponders  suitable  for  broadcasting  data 
over  a  limited  distance.  In  general,  the  source 
and  destination  terminals  cannot  hear  each  other 
directly,  and  the  data  has  to  be  relayed  by  one 
or  more  intermediate  devices.  A  separate  set  of 
devices,  called  repealers,  may  exist  for  this 
purpose,  or  the  terminals  themselves  may  relay 
messages  for  one  another. 

Control  of  the  network  is  completely  distri¬ 
buted.  i  e  ,  no  station  or  centra)  control  mechan¬ 
ism  is  assumed  to  exist.  Rather,  we  assume  that 
each  source  terminal  has  prestored  one  or  more 
routes  to  all  destinations  and  includes  all  neces¬ 
sary  routing  information  in  the  packets  it  trans¬ 
mits.  These  assumptions  are  made  merely  to 
simplify  the  presentation.  In  tact,  the  results 
presented  are  valid  for  networks  using  alternate 
routing  as  long  as  routing  changes  are  not  made 
over  short  lime  intervals.  One  of  the  motivations 
for  this  study  came  from  a  consideration  of  the 


design  of  routing  procedures  for  such  networks 
It  was  nc  essary,  however,  to  first  develop  an 
understanding  for  the  throughput  of  various 
topologies 

Exogenous  traffic  is  modeled  as  independent 
Poisson  processes  arriving  at  each  sourre  nodi, 
wuh  appropriate  rates  end,  initially,  exponentially 
distributed  packet  lengths.  The  topology  is 
specified  by  a  listing  of  which  terminals  <ot 
repeaters)  can  hear  each  other.  In  ihe  remainder 
we  will  not  distinguish  between  terminals  and  re¬ 
peaters  and  will  refer  to  them  collectively  as 
either  terminals  <.  nodes.  In  general  the  trans¬ 
missions  of  one  terminal  can  be  heard  by  many 
other  terminals.  The  routing  will  specify  which 
terminal  is  to  repeat  the  packet,  if  necessary. 

If  two  or  more  transmissions  are  simuliar.- 
eously  heard  by  a  terminal  (calied  a  "collision'’) 
at  least  one,  and  possibly  both,  is  "lost"  and 
must  be  retransmitted.  We  assume  retransmis¬ 
sions  are  scheduled  at  a  random  instant  in  time 
sufficiently  far  in  the  future  so  as  to  preserve 
the  Poisson  nature  of  the  combined  traffic  stream, 
which  now  consists  of  exogenous  traffic  and 
rescheduled  traffic.  For  this  study,  we  assume 
that  a  pac Vet  can  be  retransmitted  as  many  times 
as  is  nee*..',  -.ary,  i.e.,  that  there  is  no  maximum 
allowable  i  mber  of  retransmissions. 

Terminals  may  either  transmit  or  receive, 
they  canr.ot  do  both  simultaneously.  Belore 
transmitting,  a  terminal  senses  the  channel.  I!  it 
detects  that  any  of  its  neighbors  (i.e.,  terminals 
that  it  can  hear)  are  transmitting  (by,  e  q.. 
sensing  a  carrier;  it  reschedules  the  transmission 
as  for  collided  packets  above.  If  at  the  sched¬ 
uled  time  for  a  transmission,  the  terminal  is 
already  engaged  in  transmitting  a  packet,  the  new 
packet  is  also  rescheduled  as  above.  Thus  pack¬ 
ets  are  continually  rescheduled  until  they  ire 
successfully  delivered  to  the  next  terminal  on 
their  route.  We  assume  that  the  total  stream  <>f 
traffic  scheduled  by  any  terminal  is  a  Poisson 
process.  Thus  includes  originating  trallic  and 
packets  rescheduled  either  due  to  collisions  or 
due  to  the  channel  having  been  sensed  busy 
This  scheme  is  called  carrier  sense  multiple  access 
(CSMA).  The  Poisson  assumption  is  valid  lor  the 
assumptions  made  above  and  will  yield  accurate 
results  for  throughput  Compromises  will  have  to 
be  made,  however  if  ,n  accurate  picture  of  time 
delays  is  to  be  considered 
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it  is  possible,  doe  to  non-zero  propagation 
delay,  that  collisions  o(  transmissions  from  neiqh- 
nowng  terminals  may  still  take  place  despite  the 
i'SMA  strategy  This  will  occur  if  a  teimmai  senses 
the  channel  before  another  terminal's  transmission  is 
received.  This  effect  is  small  if  terminals  are 
reasonably  close  or  are  not  transmitting  at  high 
speed  We  will  ignore  this  phenomena  here,  and 
assume  that  all  transmissions  are  instantly  heard  by 
their  neighbors. 

A  passive  acknowledgment  is  used  for  trans¬ 
mission  to  neighbors.  The  transmitting  terminal 
listens  to  the  channel  to  hear  if  a  packet  is  being 
rebi oadcast  by  a  neighbor.  If  after  a  prespecified 
time  interval,  the  transmitting  node  does  not  hear 
the  packet  rebroadcast,  it  retransmits  the  packet. 
But  the  packet  may  have  been  successfully  received 
by  the  neighbor  even  though  the  originator  does 
not  hear  the  rebroadcast.  We  assume  that  duplicate 
packets  are  transmitted  and  are  deleted  only  at  the 
final  destination  We  assume  an  end-to-end  acknowl¬ 
edgment  is  returned  to  the  originator  from  the  final 
destination.  In  this  paper  we  assume  that  passive 
acknowledgments  are  always  heard  and  ignore  the 
effect  of  end-to-end  acknowledgments.  Alternately, 
these  acknowledgements  could  have  been  added  lo 
the  required  traffic. 

We  depict  the  topology  of  the  network  by  a 
graph  where  terminal*  are  represented  by  nodes. 
The  nodes  are  connected  by  a  link  if  they  can  hear 
each  other's  transmissions ,  i.e.,  if  they  are  neigh¬ 
bors  As  an  example  see  figure  1.  Node  A  can 
hear  node  B.  but  not  node  C  Node  B  can  hear 
both  noaes  A  and  C.  Node  C  can  hear  node  B, 
but  not  node  A  If  node  A  is  transmitting  to  node 
B  and  node  C  begins  transmitting,  then  the  trans¬ 
mission  from  A  to  B  may  be  lost  depending  upon 
the  "capture"  assumptions  we  make.  A  conservative 
assumption  is  that  the  A  to  B  transmission  is  lost  - 
this  is  known  as  zero  capture.  Alternately,  perfect 
capture  assumes  that  this  transmission  is  success¬ 
fully  received.  Half-amplitude  capture  assumes  that 
the  transmission  is  lost  if  C  dominates  A  at  B. 
This  can  happen  if  C  is  closer  lo  B  than  A  is  to  B , 
or  hjs  a  greater  signal  strength  perceived  at  B 
than  A  has  If  A  dominates  C,  then  the  transmis¬ 
sion  is  successful.  We  will  consider  these  different 
capture  situations  below.  Note  that  under  C.SMA  if 
node  B  is  transmitting,  neither  A  or  C  is  allowed  to 
transmit. 

We  assume  that  a  routing  has  been  specified. 
I  his  takes  the  form  of  deciding  which  of  the  neigh¬ 
bors  are  to  rebroaacast  a  packet  from  a  particular 
source  to  a  particular  destination.  Thus  the  amount 
of  traffic  that  a  terminal  wishes  to  send  to  its 
neighbor  can  be  computed.  If  these  rebroadcast 
pa.  ki-ts  arc  scheduled  at  a  random  time  far  in  tne 
! i  ut  c  the  I ’oi s son  assumption  (or  traf'ic  stream:  is 
l  reserved .  Vie  assume  that  the  tiaf'ic  between 
neighbors  is  specified  and  form  independent  Poisson 
pro,  i'S',es  When  considering  exponentially  disbri- 
butid  packet  lengths  we  assume  that  the  packet 
l.-ilth  is  reassigned  independently  at  each  hop 
Alihu.jgh  this  leads  to  some  anamolies  in  interpret¬ 


ing  the  results  it  is  a  key  assumption  in  th< 
model  and  analogous  to  the  "independence 

assumption"  in  queuing  networks 

The  details  ol  CSMA  for  a  single  hop  net¬ 
work  can  he  found  in  the  papers  by  Tohagi  arid 

Kleinrock'^  Tobagi  has  also  developed  some 

simple  models  for  two-hop  networks'  .  Details 
o'  a  packet  radio  network  ran  be  found  in  a 

paper  by  Kahn1'^  A  discussion  of  routing  i ri 
multibup  packet  radio  ran  be  found  in  the  paper 

by  Gitman.  Van  Slyke,  and  Frank^. 

III.  GENF.RAL  RESULTS 

In  this  section  we  develop  some  expressions 
that  are  valid  for  the  packet  radio  network  we 
have  modelled  above  using  CSMA  and  with  an 
arbitrary  packet  length  distribution.  Let  i  be  a 
node,  n(  one  of  its  neighbors,  N(*  the  set  of  all 

the  neighbors  of  i,  and  the  set  of  all  i's 

neighbors  and  including  i.  Without  loss  of  gene¬ 
rality  assume  the  average  packet  length  at  each 
terminal  is  unity.  Let  G  be  the  total  rate  (in 
packets/sec)  of  all  scheduled  traffic  at  node  i. 
This  includes  originating  traffic  and  all  resched¬ 
uled  traffic  and  is  assumed  to  be  Poisson. 

Node  i  is  either  busy  (transmitting)  or  idle. 
It  will  transmit  a  scheduled  packet  if  at  the 
instant  >t  is  scheduled  all  nodes  in  N(  are  idle. 

Let  A  be  a  set  of  nodes.  Let  P(  A)  be  the  proba¬ 
bility  thai  at  a  random  instant  all  nodes  in  A  are 
idle.  Tne  nodes  not  in  A  may  or  may  not  be 
idle.  Similarly  P(i),  P(i,A),  P(A,B)  are  the 
probabil.ties  that  i  is  idle,  i  and  nodes  in  A  are 
idle,  and  all  nodes  in  A  and  B  are  idle. 

Since  iraffic  is  scheduled  at  node  i  with  a 
Poisson  rate  G  ,  will  be  transmitted  only  if  N  is 

i  i 

idle,  and  transmissions  have  unity  average 
length,  the  probability  that  i  is  busy  is  given  by 

l-P(i)  =  G|P(N-)  (1) 

If  i  is  busy  then  under  CSMA,  n(  must  be  idle. 
Then  since 

Pfn^  =  P(n..i)  ♦  P(n(|  i  busy)  (l-P(i)] 
and  P( n  f  |  i  busy)  =  1.  we  have 

P(n.  ,i )  -  P(n()  ♦  P(i)  -  1  (2) 

Similarly  .  if  A  ”  N  *  , 

P(A.i)  -  P(  A )  ♦  P(i)  -  1  O) 

letting  A  -  N  *  in  equation  (3),  and  using 


P(N4*)  =  P(Nj)/P(i  |Nj*) 


we  get 

PUINi*)--!^  (4’ 

Equation  (4)  is  often  found  in  CSMA  literature. 

A  packet  from  i  to  n(  will  be  transmitted 
when  it  is  scheduled  if  N;  is  idle.  During  the 
transmission  all  nodes  in  N.*  will  be  idle.  It  will 
be  successfully  received  at  n(  if  (1)  all  neighbors 
of  not  in  hr  are  idle  at  the  beginning  of  trans¬ 
mission  and  (2)  depending  upon  the  capture 
assumptions,  none  of  these  nodes  begin  transmit¬ 
ting  until  i's  transmission  is  ended.  Otherwise  a 

collision  will  occur.  Let  S.  be  the  rate  in 

i.n. 

packets/sec,  determined  by  the  routing  and  as¬ 
sumed  Poisson,  of  the  traffic  that  i  wishes  to 
send  to  n^.  This  is  the  required  throughput  or 

offered  traffic.  Let  G.  _  be  the  rate  of  all 

i,ni 

scheduled  traffic  from  i  to  n^  We  have  also 

assumed  that  all  these  streams  are  Poisson  and 
independent.  Of  these  G;  R  packets  (per  sec¬ 
ond),  S.  must  be  successful.  Thus 
1 ,  n 


Markov  process  where  Ihe  stales  are  identified  by 
which  nodes  are  idle  and  which  are  busy.  Let  D 
be  a  set  of  busy  nodes.  because  of  CSMA,  no 
nodes  in  [>  may  be  neighbors  of  each  other.  We 
shall  refer  to  such  a  set  of  non-neighbors  (or 
strangers)  as  being  "strange  "  Let  Q(D)  be  the 
probability  that  at  an  instant  of  time,  all  nodes  in 
D.  which  is  strange,  are  busy,  and  all  nodes  not 
in  D,  are  idle.  Then  each  strange  set,  D,  repre¬ 
sents  a  state  in  a  Markov  system,  and  0(D)  is 
the  stale  probability.  The  collection.  D,  of  all 
strange  sets.  £,  represents  all  the  states  in  the 
system.  In  particular,  the  null  set  D  =  ♦,  re¬ 
presents  the  state  that  all  nodes  are  idle. 

Assume  the  system  is  in  state  D.  It  will 
leave  the  state  if  any  icD  stops  transmitting. 
This  happens  with  rate  =  1 .  Thus  the  transi¬ 
tion  to  state  { D-i  J  occurs  with  rate  pj.  The  only 

other  way  to  leave  state  D  is  for  one  of  the  idle 
nodes  that  is  a  stranger  to  all  icD  to  begin  to 
transmit.  This  occurs  with  rate  A^  =  Gj.  (We 

use  Mj  and  A;  for  the  moment  for  clarity).  Let 
be  the  set  of  all  neighbors  of  all  nodes  in  D. 
Then  the  transition  from  D  to  |D+j),  j^Np,  occurs 
with  rate  A..  The  global  balance  equations  for 
this  system  are 


5— 1  =  P(N{.Nn  )P  (X)  (5) 

i.Hj  i 

Let  D.  _  be  the  nodes  in  N  that  dominate  i  at 

'•ni  ni 

0^  Then  P(X)  is  the  probability  that  no  nodes 

in  Dj  n  begin  transmission  during  the  transmis¬ 
sion  of'  the  packet  from  i  to  nj  conditioned  on 
node  t  transmitting,  all  nodes  in  N.*  being  idle 

throughout  the  transmission,  and  all  nodes  in 
Nn  being  idle,  at  least,  initially.  If  D.  n  is 

empty  or  perfect  capture  is  assumed  then  P<  X  )=1 ! 

The  total  scheduled  traffic  at  a  node  is 
given  by 

Gj  1  !  <6> 

n.r.N.a  1 


(iMj  ♦  IA}  )  Q(D)=  iAs  Q(D-i)+  l  Mj  Q(D*j)  (7) 
icD  j*ND  icD  j<ND 

where  DcM  the  collection  of  all  strange  sets. 

It  is  easy  to  see  that  these  equations  are 
satisfied  by 


Q(D)  =  -  Q(D-i)  =  GjQ(D-i),  icD  (8) 

Mi 


Q(D)  =  (  n  Gj)  <?(♦)  (9) 

itD  1 

where  we  adopt  the  convention  that  n  G|  =  1 
Summing  over  all  Dcj&,  we  get  it* 


l  Q(D) 
Dt  D 


I  (n  q.)  Q(»)  =  1 

D lD  icD 


Prom  equations  (1)  through  (6)  we  wish  to  derive 

a  relation  between  the  S  and  G. ,  and  deter- 

i ,  n  j  i 

mine  the  maximum  S  the  network  can  support 

'•ni 

This  we  call  the  (maximum)  throughput  or  capa¬ 
city  In  the  next  section  we  develop  this  rela¬ 
tionship  for  exponential  packet  lengths  and  arbi¬ 
trary  topoloqies  Later  we  present  some  results 
lor  non-exponential  lengths  and  simple  topologies. 

IV  EXPONENTIALLY  DISTRIBUTED  PACKET 
LENGTHS 

If  the  packet  lengths  are  exponentially 
distributed,  then  the  system  can  be  viewed  as  a 


In  the  previous  section  we  found  we  were 
interested  in  quantities  like  P(A),  where  A  is  any 
set  of  nodes,  and  P(A>  is  the  probability  that  all 
nodes  in  A  are  idle,  and  all  nodes  not  in  A  may 
or  may  not  be  idle  This  can  be  found  by  sum¬ 
ming  O(D)  over  all  sets  L>  that  do  not  contain 
nodes  in  A  Thus 


P( A )  =  n  0(D)  = 
D r  AL 


l  r  (n  G,) 
PCAc  icD 

i  (n  G^ 

DC^  icD 
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uticr'  I),  A  refers  lo  ail  strange  sets  contained  in 
the  ■  omplemonl  of  A  We  adopt  the  shorthand 
ru.t.ition , 

SP(B)  =  l  (fl  G.  )  (12) 

DCBitD  1 


Equations  (II)  and  (S)  can  be  used  for  any 
topology  to  generate  the  solution  to  our  problem 
I  he  equations  i  elating  the  n  and,  G  can  be 

solved  iteratively,  for  example,1  equation  (5)  now 
becomes 


where  SP  refers  to  sum  of  products.  Thus 

P(  A )  =  SP(AC)/SP(~)  (13) 

where  is  the  set  of  all  nodes. 


SP(|N>Nn  ))c 
~5PW' - 


P(X) 


(14) 


As  an  example  consider  the  chain  of  four 


We  assume  Sj2=  S2 j = 


nodes  shown  in  figure  2 
S.,3=  S32=  S34=  S^3=  S  for  simplicity,  and  perfect 
capture. 


G12=  Gj 

•  G43= 

1=  G34  ■ 

and  G. 

'2~  G21  + 

G23=G‘ 

eguation  (5),  we  have 


But 


J  =  P(1 ,2,3)  =  J-, 
G1  G21 


^23 


23‘  32' 


P(l. 2,3,4) 


SP(»)=  I  (riGj)=l+G1eG2+G3«G4+G1G3eG2G4+G1G4 
DcftcD 


-  1  +  2Gj  +  2G2+  2GjG2*  Gj-  & 


and 

P(  1,2, 3,4)=] /A,  P(l,2,3)=(leG4)/a=(l+Gj)/A. 
Solving,  we  get 

G2  =  Gj(2+Gj)  and  S  =  Gjd+Gj)/*  . 
or 

S  =  Gj  ( 1  +  Gj )/( 1  +  6Gj»  7G]*+  2Gj3). 

W>  can  now  find  the  maximum  value  of  S  possible, 
the  throughput  of  the  chain,  which  is  .128, 
obtained  when  Gj  =  0.71. 

We  can  also  consider  half-amplitude  or  zero 
capture.  Tor  example,  consider  zero  capture. 
Then  when  2  is  transmitting  to  3,  4  can  inter¬ 
fere  But  1,  2  and  3  are  idle,  and  4  is  initially 
idle  The  transmission  is  successful  if  2  finishes 
transmission  before  4  starts  But  these  are 
events  occurring  in  exponential  time. 


where  by  A+B  we  mean  the  union  of  A  and  B. 
For  perfect  capture  P(X)=1.  For  other  capture 
modes,  the  computations  for  P(X)  are  straight¬ 
forward  but  can  become  tedious  for  complex 
topologies. 

Evaluation  of  sums  of  products  in  equation 
(12)  are  made  easier  by  the  following  two  rules. 
Consider  two  sets  of  nodes  A  and  B  such  that  no 
node  in  A  can  hear  any  node  of  B.  These  sets 
may  be  called  non-communicating.  Then 


SP(A+B)  =  SP(A)  SP(B), 

A&B  non-communicating  (IS) 

Also, 

SP(A)  =  SP(A-i)  ♦  GjSF(A-Nj),  ieA  (16) 

To  prove  these  rules  just  consider  all  products. 
We  have  successfully  evaluated  many  complex 
topologies  with  these  procedures. 

There  are  other  relations  which  will  be  found 
useful  in  extending  our  model  to  more  complex 
situations.  We  prove  some  of  these  below.  Let  C 
be  a  cut,  i.e.,  a  set  of  nodes  that  divides  the 
network  into  three  parts  A,B,  and  C,  where  A 
and  B  are  non-communicating.  Let  A=A.+  A?,  B 
=  B  j+  B2  where  Aj  A2=  Bj  B2r4-  Then 


P( A. ,C, B. )  SP(A  +B.,)  SP(A-) 
P(Aj  |C,Bj  )-  pfc.fej)  =  ‘SP(A7gp=  'S'RaT 

But 

P(Aj,C)  SP(A,+B)  SP(A2) 
P(AllC)  1  P(C)  =  SP( A+B)  =  SP(A) 


Thus  P(Aj|C,Bj)  =  P(Aj|C),  C  a  cut  (17) 


Thus  P'X)  - 


1 +G , 


by  symmetry .  Now 

1  Uj  , 


r  P(l,2,3,4)  — ri7.  •  — 5-».  is  unchanged. 
r,23  1  G4  G21 

C 

is  a  little  harder  to  obtain,  but  is  a  straight- 

1 

forward  calcuiation. 


We  also  have 


P( A, ,  Bj|C) 
in  particular 


=  P(Aj|C)  P(BjjC).  C  a  cut 
if  C  =  N.* ,  then 


(18) 
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Ft  1 1  N(* ,  B)  fH  i  |  N  ( * ) ,  BG  Nc  ay) 

V  Fx.imples 

We  consider  here  three  different  topological 
structures  with  exponentially  distributed  packet 
lengths  and  perfect  Cdpture.  Vie  assume  all 

b  S,  tor  ail  i.  and  take  lull  advantage  of 
i.n, 

symmetry  The  three  topologies,  shown  in  fig¬ 

ures  3.4,  and  5,  are  a  chain,  a  ring,  and  a  star, 
,iii  with  various  lengths.  In  each  case  we  find 

the  maximum  throughput,  S.  These  are  given  in 
"I  able  I . 

Note  that  for  a  ring  greater  than  7  nodes 
the  maximum  throughput  is  the  same  as  that  for  a 
lonq  chain  This  is  expected  since  the  congestion 
is  now  in  the  middle  and  it  is  unimportant  whe¬ 
ther  or  not  the  chain  is  closed.  A  star  with  two 
legs  is  )ust  a  chain  with  N  for  the  star  replaced 
by  2NM  for  the  chain  Consider  the  star  config¬ 
uration  as  representing  the  center  node  (0) 

trying  to  transmit  to  some  node  or  nodes  far  away 
via  many  repeaters  for  one  leg,  the  maximum 
rate  is  086.  For  two  legs,  the  maximum  rate  is 
2S  or  .172,  exactly  twice.  The  results  are  shown 
in  Table  II  where  the  throughput  of  the  center 
node  is  given  by  LS  We  see  from  Table  II  that 
whereas  the  throughput  doubles  for  L  =  2,  it 
increases  only  by  20*1,  when  L=3,  by  4-5S  further 
when  1.-4 ,  and  by  2%  when  L-5  Congestion  at 
the  central  node  is  limiting  its  ability  to  increase 
its  throughput.  Additional  legs,  beyond  three, 
are  not  really  helpful  We  are  evaluating  other 
topologies  to  find  ways  to  increase  the  throughput 
of  a  central  node  in  such  a  configuration. 

VI  Non-exponential  Packet  Lengths 

For  non-exponential  packet  lengths,  equa¬ 
tions  ( ’.  )  through  (6)  are  still  valid  but  are 

insufficient  to  solve  for  the  G,  and  the  S,  .  For 

l  i ,  n; 

a  three  node  chain  and  a  star  with  any  number  of 
legs  but  N  =  1  we  have  been  able  to  prove  equa¬ 
tion  (IS)  for  arbitrary  packet  length  distribu- 
t.ons  The  equations  for  !'<  A 1  are  identical  in 
form  as  for  exponential  lengths  but  the  terms 
th  X  )  tor  half-amplitude  and  zero  capture  are 
di' let  ftit .  For  example,  lor  constant  packet 

lengths  .nstead  ol  1/d‘Gi  we  now  have  e 
'!  nese  two  simple  topologies  can  be  completely 
analyzed  tor  any  capture  mode.  Further  details 
w  a  not  be  given  here . 

V..'  have  not  been  able  to  derive  a  similar 
r  i  ,tior.‘  hip  lor  other  topologies.  However,  we 

that  equations  (17)  through  d9)  are  tempting 
ipnroximalion;.  to  try  Then  if  together  with 
c  ;  .at i.  r«  ■  1  )  through  (»>  tte-y  are  sullirient  to 
equine;:',  to  tied  the  throughput, 
tf"  resulting  approximate  solution  has  some  ap- 
we  have  been  able  lo  do  this  for  several 
••erg  ,i  t  ipdo.qies  but  have  not  verified  the  ac- 
■  ..:  i  '.  --.!  trie  apnrnxim.i! ion  Consideration  of 

.  •••  i  *  pi  i  led  rapture  in  these  cases  becomes 

let)  .1 .  t  t  l<  u  i  l 


VII  extensions  and  Conclusions 


We  have  presented  a  simple  but  burly  realis¬ 
tic  model  ot  a  multihop  packet  radio  network  and 
have  obtained  maximum  tin cuglipi.t;;  dr  general 
topologies  and  exponential  message  lengths.  The 
procedures  are  easiest  to  implement  when  perfect 
capture  is  assumed  Other  capture  modes  can  be 
handled  in  a  straightforward  but  somewhat  more 
tedious  manner.  We  have  assumed  perfect  recep¬ 
tion  of  acknowledgments  and  have  not  included 
additional  traffic  due  to  end-to-end  acknowledg¬ 
ments  Some  aspects  ol  acknowledgments  can  be 
included  by  increasing  the  required  traffic.  We 
are  investigating  the  effect  of  imperfect  acknowl¬ 
edgments  and  different  reiransmisslon  strategies. 
The  model  should  still  be  useful  under  these 
extensions. 

We  have  been  able  to  derive  results  for 
non-exponential  packet  lengths,  but  only  for 
simple  topologies.  Our  procedure  suggests  a 
simple,  and  probably  good,  approximation  to  be 
used  for  more  complex  topologies.  However, 
imperfect  capture  becomes  a  more  serious  prob¬ 
lem.  We  are  investigating  these  extensions. 

Finally,  we  are  applying  these  techniques  to 
evaluate  different  routing  strategies  and  trans¬ 
mission  schemes  in  a  muitihop  packet  radio  net¬ 
work. 
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Table  I  Maximum  throughput  (S) 


N umber 
of 

Nodes,  N 

Chain 

Star 

Ring  L=2  L^3  L=4  L=5 

1  1 

.000 

1  000  .167  .103  .074  .058 

T 

900 

.500  .111  .076  .057 

3 

.167 

.167  .097  .072  .055 

4 

.128 

.073  .092  .070  .054 

5 

111 

.100  .069  .054  .044 

6 

.102 

.083 

7 

.097 

.087 

8 

.094 

.085 

9 

.092 

.086 

10 

.091 

.086 

ac 

.086 

.086  .086  .069  .054  .044 

Table  II.  Maximum  Throughput  of  the  Central 
Node  in  a  Large  Star  Arrangement. 


Number  of  Legs 

Maximum  Throughput 

1 

.086 

2 

.172 

3 

.207 

4 

.216 

5 

.220 

Figure  5.  A  star  network  (L  legs) 


O - O - O - O 

12  3  4 


Figure  2.  A  four  node  chain 


Figure  4.  A  ring 
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Abstract 

We  consider  the  problem  of  obtaining  exact 
expressions  for  throughput  and  blocking  probabil¬ 
ities  in  multihop  packet  radio  networks  operating 
under  CSMA.  We  have  obtained  exact  results  for 
a  general  class  of  message  lengths,  for  general 
topologies,  and  for  perfect  capture.  These 
results  were  obtained  by  assuming  perfect  ack¬ 
nowledgments.  Here,  we  extend  these  results  to 
include  situations  where  acknowledgments  are  not 
always  heard.  We  also  introduce  and  analyze  a 
new  acknowledgment  protocol  that  significantly 
improves  performance. 


I.  Introduction 

We  consider  the  problem  of  obtaining  exact 
expressions  for  throughput  and  blocking  probabil¬ 
ities  in  multihop  packet  radio  networks  operating 
under  carrier  sense  multiple  access  (CSMA). 
Procedures  have  been  developed  to  analyze  gen¬ 
eral  topologies  for  a  general  class  of  packet 

length  distribution  and  perfect  capture.4  That 
work  assumed  perfct  acknowledgements .  Here, 
we  extend  these  results  to  include  situations 
where  acknowledgments  are  not  always  heard.  We 
also  introduce  and  analyze  a  new  acknowledgment 
pfotocol  that  significantly  improves  performance. 

II.  The  Network  Model 

We  consider  the  problem  of  analyzing  the 
throughput  capabilty  of  a  multihop  packet  radio 
network  operating  under  carrier  sense  multiple 
access  (CSMA).  Thus,  we  assume  that  the  net¬ 
work  is  comprised  of  terminals  equipped  with 
radio  transponders  suitable  for  broadcasting  data 
over  a  limited  distance  In  genera!,  the  source 
and  destination  terminals  cannot  hear  each  other 
directly,  and  thus  data  has  to  be  relayed  by  one 
or  more  intermediate  devices  A  separate  set  o! 


*  This  research  was  partially  supported  by 
USARMY  CENCOMS  under  contract  DA  AK 
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devices,  called  repeaters,  may  exist  for  this  pur¬ 
pose,  or  the  terminals  themselves  may  relay  mes¬ 
sages  for  one  another. 

Control  of  the  network  is  completely  distrib¬ 
uted,  i.e.,  no  station  or  central  control  mech¬ 
anism  is  assumed  to  exist.  Rather,  we  assume 
that  each  source  terminal  has  prestored  one  or 
more  routes  to  all  destinations  and  includes  all 
necessary  routing  information  in  the  packets  it 
transmits.  These  assumptions  are  made  merely  to 
simplify  the  presentation.  In  fact,  the  results 
presented  are  valid  for  networks  using  alternate 
routing  as  long  as  routing  changes  are  not  made 
over  short  time  intervals. 

Exogenous  traffic  is  modeled  as  independent 
Poisson  processes  arriving  at  each  source  node, 
with  appropriate  rates  and  packet  lengths.  The 
topology  is  specified  by  a  listing  of  which  termi¬ 
nals  (or  repeaters)  can  hear  each  other.  In  the 
remainder  we  will  not  distinguish  between  termi¬ 
nals  and  repeaters  and  will  refer  to  them  col¬ 
lectively  as  either  terminals  or  nodes.  In  general 
the  transmissions  of  one  terminal  can  be  heard  by 
many  other  terminals.  The  routing  will  specify 
which  terminal  is  to  repeat  the  packet,  if  ne¬ 
cessary  . 

If  two  or  more  transmissions  are  simulta¬ 
neously  heard  by  a  temmal  (called  a  "collision") 
at  least  one,  and  possibly  both,  is  "lost"  and 
must  be  retransmitted.  We  assume  retransmis¬ 
sions  are  scheduled  at  a  random  instant  in  time 
sufficiently  far  in  the  future  so  as  to  preserve 
the  Poisson  nature  of  the  combined  scheduled 
traffic  stream,  which  now  consists  of  exogenous 
traffic  and  rescheduled  traffic.  For  this  study, 
we  assume  that  a  packet  can  be  retransmitted  as 
many  times  as  is  necessary,  i.e.,  that  there  is  no 
maximum  allowable  number  of  retransmissions. 

At  any  time,  terminals  may  either  transmit  or 
receive,  they  cannot  do  both  simultaneously 
Before  transmitting,  a  terminal  senses  the  chan¬ 
nel.  If  it  detects  that  any  of  its  neighbors  (i.e  , 
terminals  that  it  can  hear)  are  transmitting  (by, 
eg.,  sensing  the  carrier)  it  reschedules  the 
transmission  as  for  collided  packets  above.  If  at 
the  scheduled  time  for  a  transmission,  the  termi¬ 
nal  is  already  engaged  in  transmitting  a  packet, 
the  new  packet  is  also  rescheduled  as  above 
Thus  packets  are  continually  rescheduled  until 
they  are  successfully  delivered  to  the  next  termi¬ 
nal  on  their  route.  We  assume  that  the  total 
stream  of  traffic  scheduled  by  any  terminal  is  a 
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Poisson  process  This  includes  oriqirja'ir.tj  traffic 
and  packets  rescheduled  either  due  to  collisions 
or  due  to  the  channel  having  been  sensed  busy 
This  scheme  is  called  carrier  sense  multiple  access 
(CSMA}.  This  Poisson  assumption  is  valid  for  the 
assumptions  made  above  and  will  yield  accurate 
results  for  throughput. 

It  is  possible,  due  to  non-zero  propagation 
delay,  that  collisions  ol  transmissions  from  neigh¬ 
boring  terminals  may  still  take  place  despite  the 
CSMA  strategy.  This  will  occur  if  a  terminal 
senses  the  channel  before  another  terminal's 
transmission  is  received  This  effect  is  small  if 
terminals  are  reasonably  close  or  are  not  trans 
mitting  at  high  speed.  We  will  ignore  this  phe¬ 
nomena  here,  and  assume  that  all  transmissions 
are  instantly  heard  by  their  neighbors. 

A  passive  acknowledgement  is  used  for 
transmission  to  neighbors.  The  transmitting 
terminal  listens  to  the  channel  to  hear  if  a  packet 
is  being  rebroadcast  by  a  neighbor.  If  after  a 
prespecified  time  interval,  the  transmitting  node 
does  not  hear  the  packet  rebroadcast,  it  retrans¬ 
mits  the  packet.  But  the  packet  may  have  been 
successfully  received  by  the  neighbor  even 
though  the  originator  does  not  hear  the  rebroad¬ 
cast.  Duplicate  packets  may  be  transmitted  and 
deleted  only  at  the  final  destination  or  they  may 
be  detected  and  deleted  earlier.  An  end-to-end 
acknowledgment  is  returned  to  the  originator  from 
the  final  destination.  In  our  original  work  we 
assumed  that  passive  acknowledgments  were 
always  heard  and  ignored  the  effect  of  end-to-end 
acknowledgments.  The  end-to-end  acknowledg¬ 
ments  could  easily  be  added  to  the  required 
traffic.  In  this  paper  we  consider  the  situation 
when  passive  acknowledgments  may  not  always  be 
heard.  We  also  introduce  and  analyze  a  new 
acknowledgment  protocol. 

We  depict  the  topology  of  the  network  by  a 
graph  where  terminals  are  represented  by  nodes. 
The  nodes  are  connected  by  a  link  if  they  can 
hear  each  other's  transmissions,  i.e.,  if  they  are 
neighbors.  As  an  example  see  Figure  1.  Node  A 


ABC 


Figure  1 

Network  Topology 


can  hear  node  B,  but  not  node  C.  Node  B  can 
hear  both  nodes  A  and  C.  Node  C  can  hear  node 
B,  but  not  node  A.  If  node  A  is  transmitting  to 
node  B  and  node  C  begins  transmitting,  then  the 
transmission  from  A  to  B  may  be  lost  depending 
upon  the  "capture"  assumptions  we  make  A 
conservative  assumption  is  that  the  A  to  B  trans¬ 
mission  is  lost  --  this  is  known  as  zero  capture. 
Alternatively,  perfect  capture  assumes  that  tins 
transmission  is  successfully  received.  Half- 
amplitude  capture  assumes  that  the  transmission  is 
lost  if  C  dominates  A  at  B  I  his  can  happen  il  O 
is  closer  to  B  than  A  is  to  B.  or  has  a  grener 
signal  strength  perceived  at  B  than  A  has  II  A 


dominates  C,  then  the  transmission  is  succ»ss(ul 
However,  in  all  cases  of  a  collision  we  assume  the 
later  transmission  is  lost.  Thus  if  C  is  trans¬ 
mitting  to  B.  this  packet  is  iost  in  all  cases.  We 
will  consider  only  perfect  capture  situations 
below.  Note  that  under  CSMA  if  node  B  is  trans¬ 
mitting  neither  A  or  C  is  allowed  to  transmit. 

We  assume  that  a  routing  has  been  specified 
This  takes  the  form  of  deciding  which  ol  the 
neighbors  are  to  rebroadcast  a  packet  from  a 
particular  source  to  a  particular  destination 
Thus  the  amount  of  traffic  that  a  terminal  wishes 
to  send  to  its  neighbor  can  be  computed.  If 
these  rebroadcast  packets  are  scheduled  at  a 
random  time  far  in  the  future  the  Poisson  as¬ 
sumption  for  scheduled  traffic  streams  is  pre¬ 
served.  We  assume  that  the  packet  length  is 
reassigned  independently  at  each  hop  Although 
we  have  extended  the  analysis  to  include  non- 
exponential  packet  length  distributions,  we 

consider  only  exponential  packet  lengths  here. 
The  results  are  similar. 

The  details  of  CSMA  for  a  single  hop  net¬ 
work  can  be  found  in  the  papers  by  Tobagi  and 

Kleinrock1.  Tobagi  has  also  developed  some 

simple  models  for  two-hop  networks2 .  Details  of 
a  packet  radio  network  can  be  found  in  a  paper 

by  Kahn3.  Details  of  our  analytic  procedure  were 

presented  at  ICC'804.  More  detail*  can  be  found 

in  the  thesis  of  Sahins. 


Analytic  Procedure 


In  this  section  we  review  the  analytic  pro¬ 
cedure  for  perfect  acknowledgments4.  Let  i  be  a 

* 

node,  n;  one  of  its  neighbors,  the  set  of  all 
the  neighbors  of  i,  and  N;  the  set  of  all  i's 
neighbors,  including  i.  Let  g.  be  the  total  rate 

(in  packets/sec)  of  all  scheduled  traffic  at  node 
i.  This  includes  originating  traffic  and  all  re¬ 
scheduled  traffic  and  is  assumed  to  be  Poisson . 
Let  1/p.  be  the  average  length  of  packets  trans¬ 
mitted  by  node  i.  Let  =  g./p^  be  a  normalized 
rate. 

Node  i  is  either  busy  ( transmitting)  or  idle. 
It  will  transmit  a  scheduled  packet  if  at  the  in¬ 
stant  it  is  scheduled  all  nodes  in  N.  are  idle.  Let 

A  be  a  set  of  nodes.  Let  P(A)  be  the  probability 
that  at  a  random  instant  all  nodes  in  A  are  idle 
The  nodes  not  in  A  may  or  may  not  be  idle 
Similarly  P(i),  P(i,A),  P(A,B)  are  the  proba 

bilties  that  node  i  is  idle,  node  i  and  nodes  in  A 
are  idle,  and  all  nodes  in  A  and  B  are  idle. 

A  packet  from  i  to  n^  will  be  transmitted 

when  it  is  scheduled  if  N  is  idle.  During  the 

*  * 

transmission,  because  of  CSMA,  ail  nodes  in  N 
will  be  idle  It  will  be  successfully  received  at  n 
if  all  neighbors  of  n.  not  in  Nj  are  also  idle  at 
the  beginning  of  the  transmission.  Otherwise 


a  collision  will  occur.  Let  s 


I  hu-. 


e 


i ,  n 


be  the  rate  in 


packets/sr,  determined  by  the  routing  ot  the 
traffic  that  i  wishes  to  send  to  This  is  the 

required  throughput  or  olfered  tratlic.  Let  g 


be  the  rate  of  all  scheduled  traffic  from  i  to  n 

t 

We  have  assumed  that  all  these  streams  are  Pois¬ 
son  and  independent.  Of  these  g  packets 


per  second,  s, 


must  be  successful.  Thus 


‘•ni 

'  -P(Nj.N  ) 


l<n; 


i. 


where  S-  =  s-  / p;  and  G-  _ 

t ,n  .  i  ,n  j  i  t , n  j 


=  ai.ni/Mi- 


O) 


when  SI'  refers  to  sum  ol  products 

t'(  A  )  SLi.  a‘  )  .1  )  .  i,  i 

The  above  equation.  an  used  b:  in, 

topology  to  gener  ate  the  solution  to  our  pt  a. 

The  equations  relating  the  s  ,  G  and  <i 

t .  n  t  1  • n  ,  I 

can  be  solved  iteratively  Lor  example,  equation 
(1)  now  becomes 


SP(|N1  a  Nn 

s  pTnT  " 


(7) 


We  have  also  given  many  rules  to  make  the  writ 
ing  of  these  equations  easier4. 

IV.  Passive  Acknowledgments 


The  total  scheduled  traffic  (normalized)  at  a 
node  is  given  by 


Gi=  *  ,G|,a  <2> 

tij  t  Nj  1 

If  the  packet  lengths  are  exponentially 
distributed,  then  the  system  can  be  viewed  as  a 
Markov  process  where  the  states  are  identified  by 
which  nodes  are  idle  and  which  are  busy.  Let  D 
be  a  set  of  busy  nodes.  Because  of  CSMA,  no 
nodes  in  D  may  be  neighbors  of  each  other.  Let 
Q(D)  be  the  probability  that  at  an  instant  of 
time,  all  nodes  in  D  are  busy,  and  all  nodes  not 
in  D  are  idle.  Then  each  set,  D,  represents  a 
state  in  a  Markov  system,  and  Q(D)  is  the  state 
probability.  In  particular,  the  null  set  D  =  $, 
represents  the  state  that  all  nodes  are  idle. 

We  have  shown4  that 

n  Gj 

Q(D)  =  j§£-  (3) 

DcN  jeD 

where  N  is  the  set  of  all  nodes  and  we  consider 
n  G.  =  1. 
id 

We  are  interested  in  quantities  like  P(A), 
where  A  is  any  set  of  nodes,  and  P(A)  is  the 
probability  that  all  nodes  in  A  are  idle,  and  all 
nodes  not  in  A  may  nor  may  not  be  idle.  This 
can  be  found  by  summing  Q(D)  over  all  sets  D 
that  do  not  contain  nodes  in  A.  Thus 


P(  A)  =  l  Q(D)  = 
DcAc 


i  n  Gi 

DcAc  icD 

1  n  <T 

DcN  ieD 


(4) 


where  DcA  refers  to  all  such  sets  contained  in 
the  complement  of  A.  We  adopt  the  shorthand 
notation 

SP( B )  =  l  (  n  G.)  (5) 

DcB  icD  1 


In  this  section  we  extend  our  previous  work 
on  throughput  in  multihop  packet  radio  networks 
to  include  the  effects  of  passive  acknowledgments 
not  always  being  heard.  Let  node  i  be  a  neigh¬ 
bor  of  node  i.  we  have  already  seen  that 

Sj  j  =  Gj  jP(NjINi)P(Ni)  =  Gj<jP(N..Nj)  (8) 

When  j  transmits  the  packet  to  the  next  node 
in  the  route,  i  can  hear  the  transmission,  and 
thus  receive  a  passive  acknowledgment,  it  all  ol 
its  neighbors  (except  j)  are  idle.  If  it  does  rot 
hear  the  transmission,  then  after  a  suitable 
time-out  period  it  retransmits  the  packet.  Pre¬ 
viously  we  assumed  all  passive  acknowledgments 
were  heard.  Here  we  see  that  a  passive  ack¬ 
nowledgment  is  heard  with  probability  P(NJNj). 

We  assume  that  the  time-out  period  is  selected  so 
that  all  nodes  have  the  opportunity  of  hearing 
exactly  one  transmission.  If  they  fail  to  hear  it, 
the  packet  is  retransmitted.  This  is  repeated 
until  an  acknowledgment  is  heard.  The  assump¬ 
tion  that  exactly  one  transmission  may  be  heard 
in  a  time-out  period  is  equivalent  to  assuming 
that  the  same  time  out  period  is  used  for  all 
nodes  and  that  it  is  long  enough  to  insure  one 
transmission , 

The  rate  Sj  ^  in  equation  (8)  includes  re¬ 
peated  packets,  because  of  failure  to  hear  ack¬ 
nowledgments.  Let  S*  j  be  the  rate  of  packets 

for  which  passive  acknowledgments  are  heard. 
Then 


su  =  qi.,  SU 

<9) 

P(  N  .N  ) 

where 

=  P(N.|Nl)  = 

pTnJ) 

HO) 

We  have  already  show 

n  how  to 

solve  sets  of 

equations 

like  (8)  for  the 

G  as 

i .  | 

a  function  ot 

the  s,.r 

and  found  the 

maximum 

possible 

which  we 

defined  as  the 

throuqhpi 

It4  Put  t  > 

ample,  in 

a  chain,  all  S 

=  S  fur  | 

-  i  and  i  -  1 
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Here  we  see  how  to  modify  this  procedure  to 
account  for  passive  acknowledgments .  The  S* 

are  determined  from  the  offered  traffic  and  the 
routing . 

We  identity  two  situations  Packets  are 
retransmitted  il  they  ate  unsuccessfully  trans¬ 
mitted  or  if  the  passive  acknowledgment  is  not 
heard.  In  the  latter  case,  a  duplicate  packet  is 
sent.  If  the  receiving  node  recognizes  the  dupli¬ 
cate  packet  and  deletes  it,  then  S*  ^  is  the  de¬ 
sired  rate  of  traffic  between  1  and  |  If  a  simpler 
node  does  not  have  this  capability,  then  duplicate 
packets  are  retransmitted,  and  after  several  hops 
many  duplicates  exist  In  this  latter  case,  S*  ■ 

includes  these  duplicate  packets  and  must  be 
related  to  the  desired  rate  of  traffic.  We  will 
first  consider  a  chain. 

Consider  the  chain  shown  in  figure  2  con¬ 
sisting  of  n  nodes.  Assume  that  duplicate  pack¬ 
ets  are  not  detected  We  assume  that  node  1  and 
n  wish  to  send  S  packets  per  second  to  each 
other.  All  other  nodes  are  just  repeaters.  When 
node  2  relays  a  packet  to  node  2,  node  1  must  be 
idle,  and  all  passive  acknowledgments  are  always 
heard.  Thus  q1  2  =  1  and' 


S*,2 


=  Sl,2  "  S 


(11) 


At  the  other  end,  node  n  does  not  repeat  the 
packets,  so  passive  acknowledgments  are  not 
possible.  However  every  transmission  must  be 
successful  since  there  are  no  interfering  nodes. 
Thus 


Sn-l,n 


n-l,n 


(12) 


We  let  Cj2  -  I)  i ,  G  2  |  *  Gj  j  —  C*2,  and  >  “  by 

symmetry  that  G3  =  G2  G«  -  G|.  and  P(l,2  i 
P(2,3.4)  from  our  previous  work  we  have 


P( 1 , 2 , 3 )  = 


1*G« 

D~ 


1*G, 

D~ 


P(1 ,2,3,4)  =  k 


(16) 


D  -  l*2G,t2G2*2G1G2*G}  =  ( 1  »G,  )2*2G2(  1*G, ) 


Thus  we  get 

G2  =  G,  ♦  G,(UG,)  =  G j (2*G, ) 

Gi  c, 

S  =  1»G,+2G2  =  (l»Gl)+2G1(2+G"i )  (17; 

We  can  solve  equation  (17)  for  the  maximum  value 
of  S,  the  throughput.  We  obtain  Smax  =  .128 

when  Gi  -  .71. 

To  include  the  effects  of  passive  acknow¬ 
ledgments  we  use  equations  (9  to  13)  to  obtain 
for  arbitrary  length  chains, 


S]2  =  S 


S: 


i,i*l 


_ S _ 

q2,3q3.4'  ’  qi,i*l 


i=2 . n-2 


(18) 


Node  i  attempts  to  transmit  all  S.  ,  ■  sue- 

1*1,1 

cessful  packets  to  i+1.  Thus 


Si*,i+1  =  Si-l,i 


i  =  2 .  n-1  (13) 

Similarly,  in  the  reverse  direction,  we  have 


Sn,n-1  =  Sn,n-1  '  S 

S2,l  =S2,1 

and  sNi,. r  s,,.-i  ■ 


(14) 


2 . n-1 


Sn-l,n  =  Sn-2,n-l 
Also, 

Si2*  -  S2  3*  “  S 

S  ,  _  _ s_ _  (19) 

i,i+l  q2  3 -  ^i-ij 

Returning  to  the  example  for  n=4  we  use 
equations  (18)  and  (10)  in  (8)  to  obtain 

S  =  G12P(.l  ,2,3)  =  q23G23P(l,2,3,4) 


Equations  (11  to  14)  relate  S|  .  and  .*  to  the 

desired  traffic,  S.  Note,  by  symmetry,  we  obtain 
relations  like  equations  (141  by  replacing  i  and  | 
by  n - 1 1  and  n-|»l,  i  espec  tively 

As  an  example  we  evaluate  the  throughout  of 
a  four  node  chain  lust  we  assume  perfect 
reception  of  acknowledgments  and  use  only  equa¬ 
tion  (8)  w.th  ail  -S  Then 


=  ^23^2  1  P(  1 .2,3) 

=  G ,  2 P ( 1 ,2,3)  =  G23  =  G21P1 ,2,3,4) 

(20) 

or  C2  =  G, ( 1  *Gj  )*G,  ( l+G,  )2  =  G,(ltG,  )(2*G, ) 
Thus 

G,  G, 

=  ( r»c  772G  ;rr*G7  *c  , )  l2n 


?  =  g12P(i.:,3) 


G2,P(1 .2,3,4) 


G-2 ,  P(  1 , 2, 3) 


s 


The  throughput  here  is  S  -  098  when 

m.ix 

G,  =  37.  a  reduction  ol  Tif, 

In  Table  1  we  comp.it  e  the  et(e,  is  of  p.issive 
ac  knowledgment  '  tor  vjt  wus  length,  rh.iina  We 
see  th  it  as  the  chain  tsei  nines  ;.,rig  I'e  through¬ 
put  vanishes  Also  shown  :n  that  t.ihle  is 

Table  1  I'.ffect  ol  Passive  Acknowledgments 


Length 
of  Chain 

Perfect 
Acknowledq . 

Throughput 

Duplicates 

Detected 

Duplicates 
Not  Detected 

4 

.128 

'  10  6 

.  098 

5 

111 

.083 

069 

6 

.102 

.072 

053 

7 

.097 

.066 

044 

8 

.094 

.063 

.038 

9 

.092 

.061 

.034 

10 

.091 

.060 

.031 

0> 

.086 

.057 

0 

ments  are  m'luded  anil  dupluate  packets  are 
detected  and  not  transmit  ted  lurthei 

We  next  Illaly/e  the  ellert  ol  pa.. .live  n  n- 
nowledqments  on  throughput  in  star  networks 
We  will  cunsidet  two  topologies  -  one  where  all 
legs  are  unconnected  In.m  each  othei  the  ■■•econd 
where  the  nudes  .  to  ,e>t  to  the  centet  ate  lull, 
connected  We  have  studied  these  networks  in 
our  previous  work  and  assumed  there  that  all 

passive  acknowledgments  were  heard  *  We  assume 
further  here  that  duplicate  packets  are  de¬ 
tected  and  not  retransmitted  See  figure  2 
for  the  configuration  of  (he  star  network  with 
zero  connectivity,  [  legs,  and  K  nodes  in  each 
leg  for  full  connectivity  the  nodes  0,1,  KM, 
2K*T . (L-l  )K*1  are  fully  connected. 

The  equations  used  in  the  previous  section 
are  still  valid  here.  When  duplicate  packets  are 
detected,  all  S.  *  =  S,  the  throughput  in  each 

leg.  The  end  conditions  are 


the  throughput  for  chains  when  duplicate  packets 
are  detected  and  not  transmitted  to  the  next 
node.  We  analyze  this  case  now 

If  duplicate  packets  are  detected  and  not 
transmitted  further,  then  if  *  is  just  the  desired 

traffic  to  be  sent  from  i  to  j.  As  before  we  find 
S-  ■  *  from  the  offered  traffic  and  the  routing. 

For  example,  in  a  chain  S*  ^  =  S*^  (  =  S  for 

i  =l,...,n-l.  Thus  equations  (8  to  10)  combine 
to  give 


S.  *  =  G.  . 
1.)  i.J 


[PC^.Nj]2 

WT) 


(22) 


except  at  terminal  nodes  where  no  further  trans¬ 
mission  is  necessary  and  -  1.  This  can  be 

solved  as  before  to  yield  the  G.  .  as  a  function  of 
the  S-  j*  and  then  the  maximum  S[  *  or  through¬ 
put  can  be  found. 

We  present  the  analysis  for  the  four  node 
chain  analyzed  above.  Here  Sj  (  =  S 

for  i  -  1,2,3.  Again  we  use  symmetry  to  obtain 


S  =  G, 


1 


0  ~  023  (>G,)I) 

D  =  (1+G,)2  ♦  2G2(1*G,) 
as  before. 


=  G, 


1+G, 


(23) 


But  G2  ~  Gt  ♦  G ( ( 1 +G !  )2 ,  so 
C, 

S  =  rr-G7)*lGjlVrT7;;;^T  <24) 

Solving  we  get  S  =  106  when  G,  =  42 

The  throughput  tor  various  lengtn 
chains  are  given  in  Table  1 .  We  see  here  that 
there  is  a  34°?,  reduction  in  throughput  tor  long 
chains  when  the  effects  ol  passive  acknowledg¬ 


q£K*L- 1 , (2»1  )K  '  1  '  £  0 . L_1  (25^ 

since  transmissions  to  the  end  nodes  are  always 
heard  and  passive  acknowledgments  are  not  sent. 
This  can  be  found  from  equation  (10).  We  fur¬ 
ther  assume  that 

IjKH.O  =  1  •  4  =  0 . <26> 

Transmissions  to  the  center  node  are  not  always 
heard.  Retransmissions  due  to  collisions  are 
included  in  our  analyst:.  However  since  the 


Figure  2 

A  star  network  with  L  legs  and  zero  connectivity. 


387 


renter  node  does  not  r"'r  ansmi!  passive  ack¬ 
nowledgments  rit  e  not  avail  able  Wr  .issame  .1 
nerfert  end-to-end  1  kn<  wt.ggmr”!  cper-ding 
Equations  H  to  ]g  nave  hem  avert  with 
perfect  passive  arknowle  'green's  md  with  imper- 
fert  passive  acknowledgments  h-.i  d.,:dn  oe  narket 
detection  fwn  topologies  hi\e  teen  durtied  - 
zero  connectivity  and  full  ee-nne,  tsv  ;ty  at  >and  the 
center  The  resultant  maximum  throughputs  pet 
leg  are  shown  in  Tables  2  and  1  the  maximum 
thioughput  Irom  the  center  node  n  given  in 
Table  4.  We  see  trom  these  results  that  there  is 
again  a  34°t,  reduction  in  tnroughput  lor  long  legs 
and  zero  connectivity  For  tuli  connectivity  we 
find  a  43  to  46“,  reduction  :n  throughout  foe  to 
the  failure  to  hear  passive  acknowledgments 
from  our  previous  work  round  that 

except  for  short  legs,  the  fully  connected  topol¬ 
ogy  is  better  when  passive  acknowledgments  are 
not  considered.  This  is  because  inward  trans¬ 
missions  from  neighbors  of  the  center  are  not 
collided  with  With  zero  connectivity  passive 
acknowledgmrn ts  for  transmissions  from  the  fir  , 
ring  to  the  second  are  interfered  with  .my  r. 
the  center  node  With  full  connectivity  1"  oth-r 
first  ring  nodes  add  to  this  interterence  This 
the  first  ring  will  have  to  retransmit  more  In 

Table  4  we  compare  the  maximum  throughputs 
from  the  center  node  for  full  and  zero  connecti¬ 
vity  and  see  that  indeed  zero  connectivity  is 
better. 

Table  2:  Effects  of  Passive  Acknowledg¬ 

ments  on  Star  Networks  Zero 
Connectivity  (Number  of  Legs, 

L-5)  -  Duplicates  Detected 

Maximum  throughput  per  Leg,  S 

Number  of - — 

Hops  Perfect  Imperfect 

per  Leg,  K  Acknowledgments  Acknowledgments 


1 

.058 

.058 

2 

.045 

.033 

3 

.044 

.0290 

4 

.044 

.0286 

6 

.044 

.0286 

Table  3:  Effects  of  Passive  Acknowledg¬ 

ments  on  Star  Networks. 

Pull  Connectivity  - 
Duplicates  Detected 

Maximum  throughput 
per  Leg, 


Number  of 

Hops 

Perfect 

Imperfect 

Legs ,  L. 

per  leg,  K 

A.  knowledg 

A.  kr.owledg 

6 

3 

04 

0213 

4 

04 

0230 

>5 

04 

0230 

9 

3 

.  0286 

0155 

4 

0380 

0154 

>5 

C2;!0 

0.54 

I  able  4  1. fleet  of  Pa  ,sive  Acknowledgment;,  on 

Star  Networks 


Maximum 

throughput  Irom  the  u-nh 

Maximum  through 

per  Leg  .  la. 

No.  cl 

No 

ol 

Perfect  Irr  per  '• 

Legs , 

Hops 

per 

nnec- 

Arknow-  Ackrtu. 

L  _ 

leg , 

K 

’  ivity 

l»dgments  ledgme 

5 

2 

Zero 

225  165 

>3 

.220  145 

6 

3 

'•1 

Pull 

240  .140 

240  138 

9 

3 

full 

257  140 

>4 

.252  .139 

V  _A_New  Protocol  for 
Passive  AcTTnowledgemen  ts 

In  the  previous  section  we  discussed  the 
eftect  of  imperfect  acknowledgments  on  the  maxi¬ 
mum  obtain, il.le  throughput  lor  various  topologies 
We  saw  that  the  maximum  obtainable  throughput 
was  reduced  more  than  30  percent  for  a  chain, 
and  more  than  44  percent  for  the  star  network 
The  protocol  that  we  used  has  the  following 
disadvantages ; 

1.  Each  node  has  only  one  chance  to  hear 
acknowledgments  Irom  its  neighbors. 

2.  Since  the  probability  of  a  node  hearing 
an  acknowledgment  from  its  neighbors  if. 
significantly  less  than  one,  some  ol  the 
successful  t'ansmissions  are  duplicate 
transmissions . 

3.  The  next  node  must  recognize  duplicate 
transmissions ,  otherwise  the  actual 
throughput  will  go  to  zero  when  the 
number  of  hops  between  source  and 
destination  nodes  is  large. 

4.  Since  each  node  must  wait  until  trans¬ 
mission  of  the  packet  by  its  neighbor  ’ 1 
hear  acknowledgments  fur  t h <> t  packet, 
the  timeout  peripd  is  large  Thus, 
delay  w,ll  be  large  too. 

We  propose  the  following  protocol.  A  node 
in  general  has  many  packets  to  transmit,  in 
addition  to  repeating  a  packet  mst  received 
These  packets  are  intended  for  ill  its  neighbors 
The  old  protocol  requires  one  to  wait  until  that 
particular  packet  is  repeated  Here  we  enlarge 
the  header  of  every  pa;  ket  so  'hat  ;t  int  .u«ie<- 
acknowledgment  in  tot  motion  fur  ail  recently  re¬ 
ceived  packets  I  has  a  "p  issive"  acknowledg¬ 
ment  can  be  received  on  the  ilex'  packet  '1  r  - 
milted.  The  acknowledgment  fur  .1  puts  ui. 11 
packet  can  lie  included  in  the  next  several  ;  uk- 
ets.  so  there  are  several  attempts  to  neat  the 
acknowledgment  At  the  expen;  e  nl  i n  1  r  *  1 -eg 
header  and  rove  teg..  in  the  node-,  ttv  eti.it  : 
passive  acknowledgments  ran  be  made  ruqngiMe 


% 


o 


lhe  protocol  l h.i  1  we  hove  developed  lo 
overcome  these  problems  is  th  it  .1  n.nie  recognizes 
0  passive  acknowledgment  lot  it*  pa*  Kid  t>y  lhe 
virtual  transmission  ot  tnat  picket  A  packet  is 
virtually  transmitted  it  its  identifier  is  transmitted 
with  another  packet  loch  node  when  trans¬ 
mitting  a  packet  includes  not  only  the  packet's 
identifier  but  also  the  identifiers  of  previously 
received  packets  A  packet's  identifier  is  con¬ 
tained  in  tne  next  m  packets  A  rode  w  juld  then 
have  m  opportunities  to  hear  it  its  transmission 
was  received  (we  Jo  not  count  the  actual  trans¬ 
mission).  If  a  packet’s  identifier  is  transmitted  m 
times  then  the  acknowledgment  for  that  packet  Is 
almost  always  heard  lor  a  sufficiently  large  value 
of  m  If  a  packet's  transmission  is  not  received, 
then  after  a  timeout  period  that  packet  is  re¬ 
scheduled  for  transmission 

VI.  The  Effect  of  the  New  Protocol 
on  ThrougFput 


Inserting  n  identifiers  Into  a  parket  will 
increase  a  packet  length  hy  a  tarter  (see  fig.  3) 


h  »d  +(n ♦  1 ) _b 
Fi  *  a  ♦  li 


(27) 


The  value  of  n  is  related  to  m  as  will  be  shown 
later  Here  n  is  the  number  of  ditferent  identi¬ 
fiers  added  to  the  packet.  for  the  moment  we 
will  assume  that  the  average  value  of  n  e.iuuS  11 

The  value  of  m  is  chosen  such  that  the 
probability  of  hearing  an  acknowledgment,  q.  is 

close  to  one.  Since  every  node  hears  acknowl¬ 
edgments  from  its  neighbors  with  probability  close 
to  one  there  are  almost  no  duplicate  transmis¬ 
sions.  Thus  nodes  do  not  have  to  recognize 
duplicate  transmissions  for  the  sake  of  through¬ 
put. 

If  each  node  virtually  transmits  a  packet  m 
times  then  the  probability  of  successfully  hearing 
an  acknowledgment  can  be  recomputed  as  follows: 


We  develop  and  analyze  a  new  acknowledg¬ 
ment  protocol  which  we  refer  to  as  the  virtual 
acknowledgment  protocol.  In  this  protocol,  il 
node  j  hears  a  packet  transmitted  to  it  by  node  i, 
node  j  includes  the  identifier  of  this  packet  in  the 
next  m  packets  it  transmits  to  any  node  (in 

addition,  of  course,  to  later  transmitting  the 
packet  itself).  We  refer  to  this  as  m  virtual 
transmissions  (and  one  actual  transmission)  of  the 
packet.  Node  1  then  has  m  opportunities,  rather 
than  one,  to  hear  thar  node  )  received  its  packet. 
Thus  the  probability  of  an  acknowledgment  being 
heard  is  greatly  increased  over  what  it  was  with 
passive  acknowledgments.  This  increases 
throughput  significantly ,  roughly  50%  for  the 
topologies  we  have  studied. 

In  addition  to  this,  node  1  can  make  the 

determination  as  to  whether  or  not  to  transmit  as 
soon  as  it  hears  any  packet  transmitted  by  node 
j.  Thus,  node  i  does  not  have  to  wait  for  its 
packet  to  be  scheduled  for  retransmission  and 
retransmitted  by  node  j.  This  reduces  delay 
significantly 

We  present  the  analysis  of  throughput  and 
overhead  due  to  the  increased  length  of  the 
header  and  give  a  general  procedure  for  com¬ 

puting  these  quantities.  We  also  present  analyses 
of  several  specific  topologies  of  interest  and 

compare  the  performance  of  this  protocol  with  that 
of  passive  acknowledgments  for  these  topologies. 


h  bits  b  bits  d  bits 


Header 

ID 

Data 

1 

2 

(n*l) 

Header 

ID 

ID  . 

. ..  ID 

Data 

h  bits  (n-*T)b  bits  d  bits 

Figure  3 

Packet  Formats  for  the  Old  and  New  Protocols 


Let  q^  be  the  probability  of  hearing  an  ack¬ 
nowledgment.  Then  the  probability  of  not  hearing 
an  acknowledgment  after  m  virtual  transmissions  is 

q*  =  1  -  (l-qjj)"1  (23) 

* 

For  large  values  of  m,  q.  goes  to  1.  When 
*  U 

cjj j  =  1  the  throughput  (in  bits/sec)  will  increase 

to  the  results  for  perfect  capture. 

* 

q(j  is  substituted  for  in  all  throuqhput 

equations  for  chain  and  star  networks  which  we 
presented  in  the  previous  sections.  The  maximum 
obtainable  throughput  for  different  topologies  are 
given  in  Tables  5  and  6  The  probability  of 


Table  5. 

Effect 
for  the 

of 

chain 

imperfect 

network 

acknowledgments  on 

%  -  \  =_  s_).  ____ 

the 

throughput 

as  a 

function  of 

Throughput 

of  a  chain 

Length 

number  of  virtual  transmissions,  m 

Perfect 

of  Chain 

Acknowl¬ 

<N) 

1 

2 

3 

4 

5 

6 

7 

8 

edgment 

•4 

.  10t» 

.119 

.  12-1 

127 

.127 

.1275 

.1276 

128 

128 

5 

.083 

.1 

.  106 

.  108 

.11 

.1105 

1108 

1109 

111 

6 

.072 

.09 

096 

1 

.  101 

T  )2 

102 

102 

102 

7 

06b 

.085 

.091 

095 

.096 

*>97 

097 

097 

0*17 

8 

063 

081 

.088 

0’H 

.093 

094 

.094 

094 

.  094 

10 

.06 

.078 

085 

088 

09 

.09 

.0903 

.0905 

091 

i>w 


% 


fable  6.  Effect  of  imperfect  ai  kn.wledgmenis  un 
the  throughput  us  ,  turn  turn  -.t  m  !.  i 
five  legs  in  a  st.tr  network  w,th  .'.to 
connectivity  (5  =  s  =  S) 


Table  7  Probability  ol  hearing  arknnwlprfumen! 

ig  )  .is  a  function  of  m  for  live-leg:, 

of  length  'j  in  a  st.u  network  with 
zero  connectivity 


Throughput  of  a  star 
with  five  legs 
number  ol  vir  tual 
transmissions,  m 


Length 
of  Legs 
(K)  1 

2 

3 

4 

5 

6 

Perfect 

Acknow¬ 

ledgment 

1 

.058 

.058 

058 

.  058 

2 

.033 

.039 

.042 

.043 

044 

0445 

045 

3 

.029 

.037 

.04 

042 

.043 

0433 

044 

>  4 

.029 

.037 

.04 

.0415 

.042 

.04.3 

(44 

hearing  acknowledgments  is  also  given  in  Table  7 

We  see  leu  Table  7  that  rne  probability  ol 
hearing  acknowledgments  approaches  1  when 
m  =  5  for  zero  connectivity.  ['he  throughput 
almost  increases  to  the  values  of  perfect  acknowl¬ 
edgment  for  m  =  5. 

We  compared  the  two  protocols  for  acknowl¬ 
edgments  for  chain  and  star  networks.  The 
throughput  of  the  center  node  increased  by  SOf, 
for  a  ten  node  chain,  by  48“*  for  zero  connec¬ 
tivity,  and  by  more  than  50^  for  full  connec¬ 
tivity. 

As  we  mentioned  before  the  length  of  a 
packet  is  increased  by  a  factor 

f  -  h  ♦  d  ♦  (m+1)  •  b 

1  *  ™  h  ♦  d  ♦  b 

because  of  the  virtual  transmissions  We  inserted 
m  •  b  extra  bits  into  a  packet  Thus,  the 
throughput  is  given  by 

Sij  1  G,j  •  P(N.'  V  •  %  ■  H-t  d  * 

(29) 


number  of  virtual  ti ansmissions .  m 


V. 

1 

2 

3 

4 

5 

6 

7 

do  i 

530 

.  700 

.810 

86 

91 

94 

95 

di; 

.670 

890 

.  960 

990 

997 

.999 

1 

d  2  3 

.890 

.980 

.990 

.999 

1 

d.34 

910 

990 

.994 

.999 

1 

1 

1 

1 

1 

dio 

.910 

.990 

.  999 

1 

d  2  1 

.930 

.990 

999 

1 

d  32 

.940 

990 

999 

1 

d  4  3 

.970 

.999 

1 

dS4 

1 

Vll .  Optimum  m  for  the  New  Protocol 

fn  the  previous  section  we  have  studied  Ihe 
eitect  of  the  new  protocol  on  throughput  We 
found  that  tor  ni  >  4  the  star  network  with  full 
connectivity  retains  its  superiority  with  respect  to 
the  other  connectivities.  In  this  section  we  will 
more  closely  study  the  L-leg  star  network  with 
full  connectivity  , 


and  hence  throughput.  There  is,  however,  a 
tradeoff  between  the  throughput  and  overhead 
(due  to  the  increase  in  the  packet's  length)  due 
to  virtual  transmissions.  For  several  cases  we 
find  the  optimum  m.  First  we  assume  that  at  all 
nodes,  if  a  packet's  identifier  is  transmitted  vir¬ 
tually  m  times,  then  we  also  insert  m  identifiers 
of  received  packets  into  the  header  of  a  packet 
which  is  ready  for  transmission.  The  next  m 
transmissions,  after  receipt  of  the  packet  in 
question  will  include  the  identifier  of  that  packet 
For  this  case  the  throughput  is  given  by 


* 


h  ♦  d  ♦  b  _ 
h  +  d  +  (m+1)  •  b 


If  we  choose  b  =  12,  h  =  84.  m  r  5,  and  d  =  960 
bits  then 


h i_»  d  ♦  b 
h  +  d  +  (m-*-!) 


=  946 


or 


m  •  b_ 

R  ♦  d  *  b 


057 


This  means  that  the  throughput  decreased  65,  due 
to  increased  header  when  compared  warn  perfect 
acknowledgments  But  we  know  ihat  (he 
throughput  increased  much  more  than  6"„  because 
of  the  new  protocol.  The  results  in  these  tables 
were  obtained  by  using 


S-.  =  G..  •  PfN  ,  N  i 
i)  i)  i  I 


* 


(30) 


In  order  to  incorporate  Eg.  (29).  'he  results 
in  Tables  5  and  6  are  divided  by  Eg.  (27). 


(31) 


where  j  is  the  one  of  i's  neighbors. 

The  results  using  Eq.  (31)  were  computed 
for  the  9-leg-star  network  for  various  values  of  m 
when  k  =  4  The  optimum  m  is  10  for  this  case 
In  Eq.  (27)  we  assume  that  the  increment  in 
a  packet's  length  is  m  •  b  bits.  But  we  know 
that  the  hotspot  (node  0)  and  nodes  at  the  end  of 
the  legs  do  not  have  to  send  passive  acknowledg¬ 
ments  because  the  transmissions  into  the  hotspot 
and  nodes  ai  the  end  of  the  legs  will  never  have 
collisions.  Thus  there  is  no  increment  in  the 
packet's  length  tor  these  nodes.  Equation  (31) 
can  be  modified  to  obtain  exact  results  as  billows 
Again  we  assume  that  node  0  sends  S  un.ts 
of  traffic  in  each  direction .  Then  the  nodes  ih.it 
have  m  •  b  bits  increment  in  their  packet  .  ieng'h 
must  send  I  •  S  units  of  tiaffic  to  their  neigh¬ 
bors  where 


I 


_  h  ♦  d  +  (m+1  )b 


(35) 


-  !)  *  d  ♦  ( m+  1  )_h 
h  ♦  d  ♦  t 


For  left  to  right  transmissions  the  through¬ 
put  equations  are 


S0l  =  S  =  G0l  •  P(N0.  N,) 

S  -  Gi,.+1  •  P<N,'  Ni+1> 


Si.,+1  ’  1 


for  1  z  1-2" 


For  right  to  left  transmissions: 


K-l 

(32) 


Sk,k-1  =  S  =  Gk,k-1 


V  k-l 

Si+l,i  =  1  •  S  =  Gi+l,i  •  P<Ni-  N,+l> 
for  i  =  0,1 . 


Vl.i 


k-2 


We  expect  that  the  results  from  Eq.  (32)  are 
greater  than  the  results  from  Eq ,  (31)  because 
Eq.  (31)  is  solved  assuming  that  the  packet's 
length  is  incremented  by  m  •  b  bits  at  all  nodes. 

The  results  for  Eq .  (32)  were  computed.  We 
found  that  the  optimum  m  is  11  and  the  total 
throughput  of  the  hotspot,  S^,  is  .2222.  When 

m  >  11  the  throughput  decreases  because  of  over¬ 
head  due  to  virtual  transmissions. 

* 

We  know  that  q  .  S  1  when  m  =  4  for  all  i 


’ll 


and  j  except  q12.  This  means  that  nodes 

1,3,4,...,  k-l  need  transmit  a  packet's  identifier 
only  four  times.  Let  us  assume  that  node  2 
transmits  m  times  and  all  the  other  nodes  transmit 
m  times.  Then  the  packet's  length  is  incremented 
by 


1. 


~  m  •  b  bits 
=  m  •  b  bits 


for  i  =  1,3,4, ... ,  k-l 


(33) 


=  0 


10  *  'k  '  '2k  ‘LK 

With  these  assumptions  Eqs.  (32)  become 

for  left  to  right  transmissions 

S 


I' 


So,  ~  Go, 

•  P(N0 

,  N, ) 

S  =  G.  .  . 
i,l+l 

•  P(N.. 

Ni+r 

qi,i+l'  1  r 

1,3,4.. 

. . ,  k 

S  =  G23  • 

P(N2, 

n3)  • 

for  right  to  left  transmissions  we  have:  (34) 

I  •  S  =  G.  ,  •  P(N- ,  N.  .  ) 

i+l,i  i  i  +  l 

•  q*+l  r  >  =  0,1, 2, 3 .  k-l 

r  •  s  =  G2l  •  P( N  j  ,  Nj)  ■  q21 
S  =  S,  ,  .  =  G,  ,  •  P(  N,  .  N,  .  ) 


h  ♦  d  ♦  b 


and 


r 


h  ♦  d 
— FTT 


+  (m'+l)b 
d  ♦  b 


(36) 


For  h  =  84.  b  =  12,  d  =  960,  and  m  =  4  the 
optimum  m'  is  19,  and  the  total  throughput 
of  the  hotspot,  ST,  is  .2390.  With  m  =  4  and 
,  *  1 

m  =  19  all  q.j  ;  1,  therefore,  the  nodes  will  not 

send  any  significant  number  of  duplicate  trans¬ 
missions  to  the  next  hop.  Since  for  node  2, 

m'  =  19,  the  increment  in  a  packet's  length  there 
is  22%  (228  bits).  For  nodes  1  and  3,  the  incre¬ 
ment  in  the  packet's  length  is  4.5%  (48  Dits). 

We  can  further  increase  the  throughput  by 
reducing  overhead  as  follows: 

A  node,  i,  receives  SD,iw  units  of  the 

1x^1/ 

successful  transmission  (throughput)  from  its 
neighbors  where 


R(i) 


(37) 


X.S» 

jeNj 

For  each  unit  of  successful  transmission  node  i 
sends  m.  passive  acknowledgments  so  that  the 
total  number  of  acknowledgments  at  node  i  is 


mi 


JR(i) 


per  unit  of  time.  But  node  i 


transmits  with  a  rate  of  G 


that  G. 


POV  >  SR(i). 


P(Nj)  and  we  know 


Therefore  each  trans¬ 


mission  must  carry  r.  passive  acknowledgments 


where 


-  mi  '  SR(i) 
i  ‘  G7  •  p(n4> 


r 


(38) 


The  overhead  in  a  packet's  length  at  node  i 
is  given  by 


lj  =  r.  •  b  ,  for  i  =  0,1 ,2 .  k. 

If  we  assume  that  the  hotspot  (node  0)  and 
nodes  at  the  end  of  each  path  (leg)  want  to  send 
S  packets/unit  time  then  the  throughput  is  given 
by 


V 


♦  d 


S  =  G- 


P(N. 


V 


Ml 

(39) 


for  i  =  0,1 .  k,  and  where  j  is  a  neighbor  of 

node  i. 

For  instance  for  the  transmissions  from  left 
to  right  we  have 


G 


•  P(  N 


N 


when  i  -  0 

S0,  =  S  =  G0,  •  P(N0,  N, )  •  q0. 


1  =  1 


Table  8 


Total  throughput  of  the  hotspot 
in  a  star  network  (m  =  4.  L  =  9, 

K  =  1)  (or  different  values  of  d. 


I 


h  *  d  ♦  m 


So.  +  S21 

(er-pTNl)>  '  b 

d  ♦  b 


*  S  -  (-J.2  *  P ( N i .  N  2 )  ■  q.2 


2 


S.2  +  SJ2 

d  *  m  -  (  GT—P^N,^  •  b 

^2  3  “ 

h  ♦  d  ♦  b 

•  S  =  G23  •  P(N2 ,  N3)  •  q23 


i  =  k-1 


d 

m  ' 

S.j. 

%  Reduction 
from  perfect 
acknowledgments 

50 

16 

.2074 

18 

100 

20 

.2173 

14 

200 

15 

.2243 

11 

400 

19 

.2355 

7 

800 

27 

.2440 

3 

1600 

35 

.2489 

1 

VIII.  Conclusions 


h  ♦  d  ♦  m 


5k-l ,k 


/  Sk-2,k-l  +  Sk,k-1  , 

1  v.  -K-.'  ’ 


h  ♦  d  ♦  b 


=  G 


k-1 


P(N 


k-r 


Nk>  •  * 


k-1  ,k 


Equation  (39)  was  used  to  compute  through¬ 
puts  for  m  =  4,  I.  =  9  and  various  m'  in  a  star 
network.  We  found  that  the  total  throughput  of 


the  hotspot,  S_,  is  .245,4  and  the  optimum  m'  is 

27  Even  though  node  2  transmits  a  packet's 
identifier  27  times  the  increment  in  the  packet's 
length  is  only  78  bits  which  is  a  7%  increment. 
This  means  that  at  node  2  each  transmission 
carries  6.5  identifiers  (r2  =  6.5).  We  also  found 


the  r,  =  3  79  and  r3  =  3.82  which  are  4%  incre¬ 
ments.  Previously  we  saw  that  there  was  a  46% 
reduction  in  throughput  due  to  imperfect  acknow¬ 
ledgments  when  we  used  the  old  protocol  (no 
virtual  transmissions).  Here  we  see  that  with  the 
new  protocol  there  is  only  a  2.6%  reduction  in 
throughput  with  respect  to  the  result  for  perfect 
acknowledgments  (.262).  Up  to  this  point  we 
assumed  that  the  original  length  of  a  packet  is 
1050  bits  (header  =  96,  data  =  960). 

The  effect  of  the  overhead  depends  also  on 
the  ienqth  of  the  data  portion  of  a  packet.  If 
the  data  portion  of  a  packet,  d,  is  small  the 
overhead  will  be  large  and  the  throughput  will  be 
reduced  We  have  solved  Eq .  (39)  for  different 
values  of  d  7  he  results  are  given  in  Table  8. 
in  Table  8  we  also  tabulate  the  reduction  from 
the  result  lor  perfect  acknowledgments.  We  see 
from  Table  8  that  when  d  increases  the  through¬ 
put  also  increases  and  the  overhead  gets  smaller. 
Wnen  d  =  50  bits  the  overhead  at  node  two  is  36% 
hut  when  d  =  2000  the  overhead  is  only  4%.  The 
reduction  in  throughput  from  the  result  of  perfect 
acknowledgment  is  18%  when  d  -  50  but  it  is  only 
.9%  when  d  -  2000 


We  have  shown  that  the  performance  of  a 
multihop  packet  radio  system  is  significantly 
reduced  by  the  use  of  passive  acknowledgments. 
We  developed  and  analyzed  a  virtual  acknowledg¬ 
ment  protocol  which  results  in  near-perfect  ack¬ 
nowledgments  with  very  little  overhead.  Thus  we 
are  able  to  overcome  most  of  the  difficulty  due  to 
imperfect  passive  acknowledgments.  We  believe 
that  this  virtual  acknowledgment  scheme  will  also 
have  a  favorable  impact  on  time  delay  and  are 
currently  in  the  process  of  analyzing  this  prob¬ 
lem. 
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ABSTRACT 

Multihop  packet  radio  networks  operating 
under  CSMA  with  perfect  capture  have  been  ana¬ 
lyzed  with  an  exact  Markov  procedure  for  expo¬ 
nentially  distributed  packet  Lengths.  In  this 
paper  we  generalize  this  procedure  and  improve 
its  computational  complexity.  We  first  prove 
that  the  exact  expressions  of  the  Markov  model 
apply  for  arbitrary  packet  length  distributions 
with  rational  Laplace  transforms.  The  analysis 
is  shown  to  depend  oa  mean  values  only,  and  it  is 
extended  to  networks  where  the  average  packet 
lengths  depend  upon  the  destination.  Finally,  it 
is  demonstrated  how  certain  properties  of  the 
analytic  expressions  permit  the  decomposition  of 
large  networks  into  computationally  manageable 
segments . 

I.  INTRODUCTION 

Packet  radio  networks  consist  of  geographi¬ 
cally  dispersed  Packet  Radio  Units  (PRC's)  broad¬ 
casting  data  over  a  limited  distance.  In  case 
two  units  cannot  have  a  direct  connection,  inter¬ 
mediate  PRU's  will  act  as  relays,  thus  creating  a 
multihop  comnuQicat ion  network.  This  is  repre¬ 
sented  by  a  graph  with  nodes  as  sources,  desti¬ 
nations,  and  relays,  and  arcs  connecting  nodes 
which  are  withia  each  other's  range.  Several 
technologies  and  routing  protocols  have  been 
proposed,  e.g.,  [lj  and  [2],  in  order  to  estab¬ 
lish  reliable  end-to-end  paths  for  maximum  net¬ 
work  throughput.  The  need  to  evaluate  different 
protocols  and  routing  procedures  initiated  sev¬ 
eral  analytic  and  simulation  studies  on  packet 
radio  network  models.  Ia  (3]  Kleiurock  aod 
Tobagi  analyzed  single-hop  centralized  networks 
employing  Carrier  Sense  Multiple  Access  (CSMA). 
Iq  this  context  they  demonstrated  the  effect  of 
propagation  delay  and  of  hidden  terminals  (PRU's 
transmitting  to  the  central  hub  but  not  listening 
to  each  other).  In  [4]  Tobagi  considered  a 
simple  finite  state  model  for  two-hop  networks. 
In  (5|  and  [6]  multihop  packet  radio  networks 
using  a  slotted  ALOHA  protocol  were  analyzed.  In 
[7j  Boorstyn  and  Kershenbaum  introduced  an  ana¬ 
lytic  procedure  under  which  multi  hop  CSMA  net¬ 
works  with  perfect  capture  can  be  analyzed 
efficiently  m  terms  of  their  throughput  perfor¬ 
mance.  This  technique  yields  exact  results  if 
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certain  assumptions  are  made,  the  most  important 
of  which  implies  a  Markov  property  as  the  under¬ 
lying  process.  Namely,  scheduled  packet  trans¬ 
missions  are  assumed  to  form  independent  Poisson 
processes  at  each  node  and  packet  lengths  are 
exponentially  distributed.  (A  discussion  of  how 
these  assumptions  relate  to  a  more  realistic  model 
is  given  in  [7).  The  most  critical  of  these 
assumptions  is  a  long  rescheduling  time.)  This 
technique  was  used  to  evaluate  deployments  exhib¬ 
iting  various  degrees  of  connectivity  and  was 
extended  to  investigate  passive  and  active  ac¬ 
knowledgment  protocols  [8]. 

In  this  paper  we  prove  that  the  Markov  anal¬ 
ysis  presented  in  [4]  holds  for  arbitrary  packet 
length  distribution  thus  relaxing  the  exponential 
length  assumption.  Similarly  it  is  demonstrated 
that  PRU's  transmitting  (or  relaying)  different 
length  packets  to  their  neighbors  can  be  easily 
handled  via  the  same  techn^ue.  These  results 
prove  to  be  insensitive  to  the  packet  length 
distribution  under  the  perfect  capture  assumption. 
Finally,  it  is  shown  how  certain  properties  of  the 
probability  measure  on  node  sets  decompose  complex 
topologies  and  accelerate  the  computational 
procedure . 

II.  THE  NETWORK  MCDEL 

We  assume  that  a  large  aumber  of  PRU's, 
dispersed  within  a  geographical  area,  transmit 
packets  of  data  addressed  to  some  destination 
nodes.  Due  to  the  broadcast  mode  of  transmission, 
all  neighboring  node  PRU's  within  the  transmit¬ 
ter's  range  hear  the  packet  and  depending  on  its 
address,  either  discard  it  or  relay  it  until  it 
reaches  its  eventual  destination.  This  deployment 
is  represented  by  a  graph  whereby  neighboring 
PRU’s  are  connected  with  an  arc.  All  units  trans¬ 
mit  and  receive  on  the  same  channel  via  a  Carrier 
Sense  Multiple  Access  (CSMA)  protocol.  Before 
initiating  a  transmission,  a  source  (or  reiay) 
checks  whether  all  its  neighbors  are  idle.  A 
collision  may,  however,  result,  due  to  "hidden'' 
terminals  outside  the  range  at  a  PRL'  initiating 
transmission  but  interfering  w.th  the  receiving 
node  (the  receiver  is  a  common  neighbor  to  two 
non-nei ghboring  transmitters).  Collisions  are 
detected  via  a  perfect  acknowledgment  protocol 
whereby  acknowledgments  are  assumed  to  be  always 
heard  and  do  not  consume  any  network  capacity. 
Packets  °ncountering  busy  carrier  condition  or 
suffering  collisions,  are  rescheduled  for  trans¬ 
missions  after  i  long  randomized  interval. 


A  r ft t  mav  not  transmit  and  tf'.'ei'.p  '  .multane- 
■usiv.  !t  an,  nouever,  lock  its  re'e;v*»r  to  the 
:*rst  pacxet  »dU  r  e  s  seU  to  it  ana  ignott*  subse¬ 
quent  ;jLi  linn  broadcasts.  This  :s  referred  to 
i  r>  pertect  .  apture  ’  in  packet  radi.-  networks. 


;  I  I  ANALYSES  FOR  PACKET  LENGTH  DISTRIBUTIONS 
UiTH  ARBITRARY  HATIONAl  LAPLACE  TRANSFORMS 

For  a  given  node,  i,  the  total  scheduling 
process  -nit.  of  :  will  be  Poisson  with  rit-*  g 


For  networks  operating  as  above,  our  primary 
objective  is  to  assess  the  performance  ot  a 
r  niting  protocol,  as  defined  by  relaying  deci¬ 
sions,  unoer  a  given  source-des t ma t ion  traffic 
profile.  Formally,  given  the  network  topology 
i location  ot  nodes  and  connectivity*,  the  packet 
rates  and  packet  *ength  averages  per  source- 
destination  requirement  i end-f n-end  traffic;  and 
the  routing  scheme,  we  can  easily  deduce  ,  the 

rate  in  packets/ sec  that  oode  i  has  to  deliver 
successfully  to  its  neighbor  node  j.  Then,  we 

want  to  evaluate  g  ,  the  rate  at  which  node  i 
tj 

schedules  potential  packet  transmissions  to  its 
neighbor  j.  The  ratio  s  /g  denotes  the  prob¬ 
ed  ^  J 

ability  of  successful  transmission  of  a  scheduled 
packet  and  provides  a  measure  for  stability  and 
delay  since  successful  packets  must  retry  after  a 
random  delay.  Extensions  to  delay  analyses  and 
effects  of  acknowledgment  schemes  are  not  con¬ 
sidered  in  this  paper. 


5  tS 


< : ) 


where  N  is  the  set  of  neighbors  of  1.  Node  1, 
generates  packets  with  lengths  having  a  distribu¬ 
tion  with  rational  Laplace  transform.  Let  — 
be  the  average  packet  length  from  node  1. 


Let  V  denote  the  set  of  all  nodes.  At  any 
instant  in  time,  the  system  will  be  characterized 
by  the  subset  of  nodes  D  which  are  simultaneously 
transmitting.  Obvious Ly  nodes  in  D  may  not  be 
neighbors . 


The  main  theory  of  this  section  states  that, 
under  the  assumptions  above,  the  steady  state 
probability  of  D,  Q(D)  is  given  by: 


Q(D)  =  Q(D-i) 


(2) 


The  analytic  technique  described  below  will 
yield  an  exact  algorithm  for  g  under  the  fol¬ 
lowing  assumptions:  L J 


or 


Q(D)  =  Q(0)  n  -i 

UD 


(3) 


1)  Hero  propagation  delay  between  neighbors. 
This  implies  perfect  CSMA  and  does  aot  allow 
any  collisions  other  than  those  due  to 
hidden  terminals.  It  is  a  valid  approxima¬ 
tion  for  deployments  within  reasonable 
distances . 

2)  Packet  lengths  are  distributed  according  to 
any  distribution  having  i  rational  Laplace 
transform.  This  class  includes  almost  all 
distributions  used  in  practice,  either 
directly  or  as  limits  from  within  the  class. 

])  Receivers  capture  the  first  transmission 
that  reaches  them  (perfect  capture). 

Perfect  acknowledgments. 


where 

QUO  =  - ! - — - 

i  n  ^ 

DC .V  ieD  Mi 

Here  we  define  n  —  =  1  if  D  =  Q. 

icD  Mi 

These  results  are  identical  with  those  reported  in 
(7)  for  exponential  packet  lengths.  Thus,  ail 
results  in  [7]  apply  for  general  packet  lengths 
Namely,  with  JT  denoting  the  set  of  neighbors  of 

node  i  and  P(A)  the  probability  that  ail  the  nodes 
in  a  set  A  are  idle,  we  have 


a-  The  streams  of  scheduled  packets  at  the 
nodes  form  independent  Poisson  processes. 
This  assumption  ls  consistent  with  Poisson 
offered  (exogenous)  traffic  and  long  random 
rescheduling  delays.  Simulation  studies  for 
single-hop  ALOHA  (d)  indicate  that  if  re¬ 
scheduling  is  delayed  more  than  10  times  the 
packet  length,  this  critical  assumption  is  a 
va 1  id  one 

o.  Packet  lengths  are  assumed  to  be  independ¬ 
ently  reassigned  at  each  node  :n  i  path. 
This  is  consistent  with  the  rescheduling 
delay  assumptions  above. 


Pr  {successful  transmission  from  i  to 

s .  . 

-U  =  P(N  LTN  ) 

*ij  1  J 

Using  the  notation 

sp(b)  i  i  n  S 

DCB  it!)  ^ 

we  have  from  (3)  that 


(4) 


1.5  ) 


SPlV-Al 
SP(  V) 


i 


P(A) 


1  Q(D)  = 

OClV-A) 


s  e  t  o  I 


no  ties  m 


Subsets  D  in  (5)  and 
eating  nodes  as  in 
tonn  the  basis  of  an 
ilong  with  definition 

ior  given  s  rates, 

,  ij 

otherwise. 


(6)  consist  of  noQ-comcmni - 
(.3).  Equations  (.4)  and  (.6) 
iterative  procedure,  which 

fl),  determines  g  values, 
ij 

if  they  exist,  and  diverges 


D  =  D,  I'D, 


Let  Np  denote  the  set  of 

D  including  D.  We  will 
notation  in  order  to 
(global  balance)  equations 


ne  i  ghfcor  s  >t  a  1  i 

make  u_.  -t  the  :ol lowing 
write  state  transition 


We  now  proceed  to  the  proof  of  formulas  (2) 
and  1.3)  for  general  packet  length  distributions, 
which  are  different  for  each  node.  We  use  the 
method  of  stages  as  developed  by  Cox,  [ 10 ) ,  to 
decompose  arbitrary  service  distributions  to  a 
combination  of  exponential  services.  Cox  demon¬ 
strated  that  any  distribution  with  rational 
Laplace  transform,  can  be  represented  by  the 
general  Er Lang-branching  configuration  of  Figure 
1.  As  in  the  figure,  a  node  i  which  initiates  a 
packet  transmission,  activates  the  first  expo¬ 
nential  server  with  rate  mJ •  With  a  given  proba¬ 
bility  1-pj  the  packet  terminates  transmission  at 

this  stage,  otherwise  it  enters  a  second  expo¬ 
nential  server.  Since  only  one  packet  is  trans¬ 
mitted  at  a  time,  a  busy  node  will  correspond  to 
one  of  the  n  stages  being  active.  The  average 
packet  transmission  time  l/p^,  equals  the  average 

residency  within  the  stage  sequence  of  Figure  1. 


p  ^  p  ^  p  ^ 

1  .  1  2 


M 


i 

n 


(7) 


-=•  \ 


=  ‘  x-v 

u '  ■ 

'"V  a 

\-Y 


{D1,...,Dk-i...  ,Dq]  ienotes  4  state 

identical  to  E,  with  node  i  removed 
from  the  set 

(D , , . . . ,D, +i , . . . ,D  j  similarly  denotes 
l  k  n 

the  addition  of  node  i  to  D, 

k 

The  global  balance  equations  will  equate  flow 
into  state  E  to  flow  out  of  state  E.  There  are 
several  possible  ways  of  entering  or  exiting  a 
state.  For  example,  a  new  arrival  to  state 

{Dj-i,  D^,...,  Dq}  with  rate  will  yield  state 

E.  A  change  of  stage  k-1  to  k  in  {D^  ,  D0 ,  ..., 

D,  ,  +i ,  D,-i,  ...,  D  }  with  rate  P.1  ,/u.1  ,  will 
k-1  *  k  *  *  a1  k-1  k-1 

result  in  E.  Accounting  for  all  possible  transi¬ 
tions,  and  denoting  the  stationary  probability  of 
state  E  by  Q(E),  we  have: 

*  gi  d2 . V 

i£Dj 

*  *  f*  Pfc-J  Pfc-1  Q(D1 . Dk-i+i-  V1 . V 

k=2  ieDk  K  1  K  1  K  1  n 

n 

*  l  l  CI-Pk)  Mu  Q(D, . O.+i . D  )  = 

k=l  i<ND  *  k  i  ic  □ 

n 

=  {  z  pi  +  l  l  +  l  g  i  Q(E)  (8) 
ieDj  k=2  ieDk  i*ND 


>  5fcq«  Otcan’oomtcn  ‘or  Nod*  *. 


The  rational  Laplace  transform  class  includes  or 
approximates  all  distributions  found  in  practice. 
As  an  example  of  the  flexibility  of  the  method, 
note  that  a  single  exponential  server  models 
exponential  packet  lengths  and  that  a  series  of 

servers  (P*=l)  approaches  the  constant  packet 
length  case  as  the  number  n  increases. 

The  system  with  decomposed  nodes  as  above, 
is  a  Markov  process  with  Poisson  scheduling  and 
exponential  servers.  The  state  of  the  system 
must  contain  information  on  busy  (transmitting) 
nodes  and  their  respective  current  stage  of 

transmission.  Let  D,  be  the  set  of  busv  PRC's  at 
k 

stage  i.  The  network  state  E  is  the  collection 
if  subsets  The  set  D  of  busy  PRC’s  regard¬ 
less  of  stage  is  given  by 


We  observe  that  the  global  balance  equations  (3) 
can  be  decomposed  into  three  sets  of  consistent 
local  balance  equations: 


8i  Q(Drl  ,D2’ '  '  ' 

•V  = 

Q(E) 

Vi  e 

Dj  (9) 

v.  Pk-1  Q(V- 

••’°k-i+i 

•V1" 

••-V 

=  Mk1-Q(E) 

VieD.  , 
k 

2  <  k  <  n 

£  <»<>  Mlk 

QCDj . 

•Vi.. 

...Dn) 

(10) 

=  gj  Q(E) 

Equations  (9)  and  (10)  equate  the  flow  into  E  due 
to  arrivals  at  stage  k  of  node  i  to  the  flow  out 
of  E  due  to  departures  from  the  same  stage  of  the 
same  node.  Equations  (11)  are  obtained  by  con¬ 
sidering  an  additional  stage,  the  idle  stage 


ebv 

node  i  i 

ls  idle  but 

.  permitted 

to  t 

ransni t , 

nd 

Then  * 

ie  equate 

the  flow  : 
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e  an 
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uat ions , 

con- 

sistent  with  til)  and  summing  up  to  the  global 
balance  equation  (8).  For  similar  reasoning  on 
local  balance  equations,  see  ( 1 1 )  and  [12].  In 
412|  similar  processes  are  characterized  within 
the  context  of  Markov  fields  for  spatial  pro¬ 
cesses. 

Fro*  the  local  balance  equations,  it  easily 
follows  that  the  stationary  state  probabili ties 
QlE)  possess  product  form  solutions,  namely 


WtDj.Dj,. 


. ,D  )  =  Q(«)  n  —  x 


g  P1  P1 

®i  1  2 


If  ^(D)  denotes  the  steady-state  probability  that 
the  nodes  m  D  are  busy  regardless  of  the  stage 
of  transmission,  Q(D)  will  consist  of  the  sum  of 

all  states  (D , , D- , . .  .  , D  }  which  include  a  node 
12  n 

i  at  some  stage  k,  leD^-  Thus 


Q(D  ,D. , . . . »D  ) 
il  n 


P*P}. . .P  1 
.  1  1  n-1 


All  partitions  such  that 

D  =  D  U  0*  L'  ...  U  D 
1  2  n 


P1  P1?1 
1  1  12 
Q(D)=Qio)  n  g.  4  +  —  .♦ 

i£D  M1  M2  M3 

and  from  (7) 


Q(D)  =  Q(4>)  n  —  q.e.d. 
ieD  Mi 

Thus  the  steady  state  probabilities  of  the  set  of 
busy  nodes  has  a  product  form  solution,  and 
depends  on  the  average  packet  length  (possibly 
different  for  each  node)  for  packet  leogth  dis¬ 
tributions  with  ratioaal  Laplace  transform. 
These  distributions  need  not  be  the  same  for  each 
node.  The  aumber  of  stages  n  used  in  the  proof 
corresponds  to  the  maximum  number  of  stages  in 
the  network.  For  nodes  with  smaller  n,  the 
branching  probability  P^  can  be  set  to  zero  to 
truncate  stages. 

IV.  NODES  TRANSMITTING  DIFFERENT  PACKET 
LENGTHS  TO  EACH  NEIGHBOR 

In  the  previous  section,  we  assumed  that 
each  node  l  schedules  packet  transmissions  at  an 
aggregate  rate  g  and  with  average  length  I/m. 


A  slight  modification  extends  the  analysis  to 
scenarios  whereby  j  node  i  transmits  packets 
different  average  length  to  each  o I  its  neighbors 
J  • 

Let  g  and  1/u  be  the  scheduling  rate  and 
l  j  i  j 

average  packet  length  for  the  i  to  j  transmission. 
We  keep  the  same  structure  as  before  by  breaking 
the  node  l  into  a  set  of  "micronodes one  for 
each  neighbor.  Micronodes  are  connected  in  our 
topology  if  they  can  hear  each  other  Obviously 
micronodes  belonging  to  the  same  node  i  are  fully 
connected  and  so  are  raicronodes  belonging  to  nodes 
connected  iQ  our  initial  topology.  As  an  example, 
a  five  aode  chain  will  be  decomposed  as  in 
Figure  2.  Since  nodes  2,  3  and  4  are  transmitting 
to  two  neighbors  they  are  decomposed  into  two 
microaodes.  By  applying  the  results  of  the 
previous  section,  we  have  that  for  a  successful 
transmission  from  2  to  3, 


—  =  Pfl  2  2'  3  30  =  SP(4^'. 

j  Sp(V) 


SP(4 , 4 '  ,5)  =  l  *  —  *  —  *  — 

^3  M45  ^54 


- {  j') 
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Fiq  2  Micronode  Decomposition  of  o  Five-Node  Chum 


Now,  define 


*4?  *45 


and  Gr  =  - —  . 

5  Mca 


in  other  words  let  G^  be  the  average  normalized 
scheduling  rate  of  node  i. 

gii 

G.  =  I  (13) 

l  .  M  ■ 

J 

It  follows  that  for  this  example  results  are 
identical  as  in  the  original  topology  before 
decomposition  but  with  replaced  by  (13). 

In  general  consider  SP(B*)  where  B*  is  the 
set  of  micronodes  associated  with  the  set  of  nodes 
B.  Let  node  i  £  B  and  denote  its  micronodes  by  ij 
where  j  are  neighbors  of  i.  We  use  the  relation 
[5], 


=  SPi  B-i  !  *  —  SPlB-N  )  14) 

r  •' r  micro nodes, 

SPiB*)  =  SPtB*-;i,jl)  *  — 1  SP(  B*-N*  .  ) 

Note  that  because  of  the  connect  ivity  of  micro- 
nudes,  S*  is  the  same  for  all  j,  namely  the  set 

ot  micronodes  of  N  ,  N*.  Let  1*  be  the  set  of 
1  i 

micronodes  of  i.  Then  proceeding  recursively  we 
:i  a  ve 

S  P  l  B* )  =  SP(B*-i*)  ♦  1  — ^  SP(B*-N*)  (15) 

i  Mij  1 

Using  (13-  and  repeating  the  above  process  we 
find  that  equation  (6)  still  holds  for  the  origi¬ 
nal  nodes,  but  with  replaced  by  given  in 

13).  In  the  special  case  where  all  packets 
transmitted  by  i  have  average  length  1/m^»  then 


As  a  special  case  with  C  containing  one 
node  i 

SP(V)  =  SP<V-1)  ♦  G  SP  ( V-N  J  -20) 

i  i 

Sets  N  aod  N_  above ,  denote  the  sets  of 

1  u 

neighbors  of  i  and  all  nodes  in  D  respectively. 
Finally,  if  C  is  a  cut,  t.e.  a  set  of  nodes  winch 
when  removed  decomposes  the  network  into  two 
isolated  subsets  A  aod  B.  it  follows  from  (16) 
that  for  all  independent  sets  D^C 

SP(V-C-SD)  =  SPCA-N  )  SP(B-ND)  (19) 

aod 

SP(V-C)  "  SP(A)  SP(B) 

As  an  example,  consider  the  10  node  network 
illustrated  in  Figure  3.  By  choosing  C  =  {5,6}, 
the  network  is  decomposed  into  three  subsets,  A,  B 
and  C  with  A  and  B  being  totally  isolated  from 
each  other.  Applying  (17)  and  (19)  we  can  reduce 
the  computations  needed  to  evaluate  SP(V)  as 
follows : 


V.  COMPUTATION  OF  SP(A) 


SP(V)=SP( 1 ,2,3,4,5,6,7,8,9,10)= 


As  seen  from  equations  (4),  (5)  and  (6) 

evaluation  of  s  ,/g  requires  an  efficient  way 
ij  *ij 

ot  computing  SP(A)  expressions  for  various  sets 
of  nodes.  Recall  that  SP(A)  stands  for  sums  of 
products  of  rates  on  all  possible  independent 

subsets  of  A.  By  an  independent  set  we  denote  a 
set  of  nodes  which  do  not  communicate  (i.e.  are 
not  neighbors ) . 


=SP( 1 ,2,3,4,7,8,9,10) 


=SP(1,2,3,4)SP(7,8.9, 10) 


♦  G5  SP( 1 , 4 , 9 , 10) 

♦  G6  SP( 1,2,7,10) 

-  G. G .  SP(1,10) 
o  o 

+  G5  SP(1 ,4) -SP(9, 10) 


s  p  i  a  )  =  i  no. 

DCA  ieD  1 


+  G6  SP(1,2)-SP(7,10) 


A  straightforward  algorithm  to  evaluate 
SP(A)  would  require  identification  of  all  inde¬ 
pendent  subsets  of  A,  a  problem  dual  to  identifi¬ 
cation  of  all  cliques  in  a  graph  and  thus  NP- 
complete  (a  clique  is  a  fully  connected  subset  of 
a  graph).  It  is  however  possible  to  handle  con¬ 
siderable  size  networks  by  using  several  proper¬ 
ties  of  the  SP(A)  function.  We  summarize  some  of 
them  and  demonstrate  via  an  example  hew  to  decom¬ 
pose  a  network  into  smaller  segments. 


If  two  subsets  of  nodes  A  and  B  are  isolated 
from  each  other,  thea  in  [7]  it  is  shown  that 


SP(AUB)  =  SP( A)  SP(B)  (16) 

If  C  is  an  arbitrary  subset  of  V,  it  can  be 
shown  that 


SP'V)  =  SP(V-C)  ♦  j  n  G. 

D  _C  ieD  1 


•  SPlV-C-Mn)l 
D  l 


■(17)* 


*  G,G,  SPU.10) 

J  o 

We  can  now  proceed  to  direct  evaluation  of  sums- 
of-products  for  subsets  with  4  nodes  at  the  most 
mstead  of  10  initially. 
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VI .  CONCLUSIONS 


This  result  is  due  to  Mr.  W.  Chen,  of  the  Bell 
Telephone  Laboratories  and  the  Polytechnic 
Institute  of  New  York  and  can  be  proven  by 
considering  all  terms  not  including  nodes  in  C 
and  then  all  terms  involving  different  inde¬ 
pendent  subsets  of  C. 


We  have  shown  how  our  work  on  an  analysis  of 
multihop  packet  radio  networks,  originally  done 
for  exponential  packet  lengths,  can  be  extended  to 
packet  length  distnbut ions  with  rational  Laplace 
transforms.  Thus  the  results  previously  obtained 
can  be  used  in  this  more  general  case  with  only 


t.  r.e  tvprjtje  p  set  length  being  re;uire]  It  was 
jiio  shewn  tr.it  the  rui.-.-  r  .'*ngth  list  r  imit  ions 
transmitted  by  a  sit:;*-  .i  ••  .**  '  •  :  lfteient  ueigh- 
’'ors  need  not  be  th*  .  •..»■  .  tiLy.  i  method  was 

iesviioei  to  simplilv  .’.c  witting  t  the  analytic 
expressions  tor  Large  networks. 

V I L .  REFERENCES 

1.  Kahn,  R.E  .  ’The  Organization  at  Computer 

Resources  into  a  packet  Radio  Network,''  IEEE 
r  rar.  sac  t  ions  on  Cutnnmni<  iti^ns.  Vo  I.  COM-25, 
Jan.  19",  p  p  .  1  o  9  -  1  "  8  . 

2.  Oitman,  I.,  R.  Van  iiyxe  and  H.  Frank, 
"Routing  m  Packet-Swi toning  Broadcast  Radio 
Networks,"  IEEE  Transactions  on  Communica¬ 
tions,  Voi.  COM-24,  Sept.  1976,  pp .  926-930. 

;  Kleinrock,  L.  ,  and  F.  Tobagi,  "Packet 

Switching  m  Radio  Channels,  Part  I  and  II," 
IEEE  Transactions  on  Communications,  Vol. 
CcM-23,  Lee.  i'9'5,  pp .  1400-1433. 

.  Tobagi,  r  ,  "Analysis  ot  a  Two-Hop  Central¬ 
ized  Packet  Radio  Network,  Part  II:  CSMA," 
IEEE  Transactions  on  Commumcat  ions ,  Vol. 
TjM-28,  Feb  1980 ,  pp .  208-216. 

,  kle ; aro: k ,  L..  and  J.  Silvester,  "Optimum 

Ti  insraissi-^n  Radii  for  Packet  Radio  Networks 
r  «hy  Six  is  a  Magic  Number,"  Proceedings 
•  r  'fie  NT  C  78,  Birmingham,  AL,  Dec.  1978. 

I  .  and  I.  SiLverster,  "An  Iterative 
f.eme  r  _» r  Performance  Modeling  of  Slotted 
C  HA  Parget  Radio  Networks."  Proceedings  of 
•he  ’  82,  Phi  lade  iphia ,  PA,  June,  1982, 

:>p  r>'.t .  \  -  bC  .  5  . 

•iO'-rstyn,  R.,  and  A.  Ke rshenbaum ,  "Through¬ 
put  Analysis  <f  Multi-Hop  Packet  Radio  Net¬ 
works,"  Proceeding  of  the  ICC  '80,  Seattle, 
"A .  June  1980,  pp .  13.6.1-13.6.6.  (An 
expanded  version  of  this  paper  has  been 
a ;i  epted  tor  publication  in  the  IEEE  TraQS- 
i-  '  *  -ns  n  '-(fnmun  icat  ions  .  ) 

•  Buorstyn,  R  ,  A.  Kershenbaum,  and  V.  Sahin, 
"A  New  .Acknowledgment  Protocol  for  Analysis 

f  Mult;  fro  Packet  Radio  Networks,"  Pro- 
•ee-ungs  i  the  OO’MPoN  82,  Washington,  DC, 
Sept.  1>82,  pp  JSj-392. 

1.  "Queueing  Systems,"  Volume  2,  L.  Kleinrock, 
John  Wiley  Si  Sons,  New  York,  1976,  p.  374. 

10  "The  Theory  of  Stochastic  Processes,”  D.R 
J.,x,  md  H.D.  Miller,  Chapman  and  Ha  Li, 
London,  !96r> 

•  1  ■  Basket’.,  f  .  M.  Chandv,  R.  Muntz,  and  J . 

Pilacius,  "Open,  Closed  and  Mixed  Networks 
ot  queues  with  Different  Classes  of  Cus¬ 
tomers,"  Journal  of  the  ACM,  Vol.  22,  No  2, 
April  1975,  pp  248-260 

12.  "Revers-  litv  and  5tothastir  Networks."  F  P 


R.  Multiple  Arecs.s  i  u  uniques  with  Arbitrary 

Packet  l.emjth  Distribution-’ 

It .  1  Introduction 

In  out  original  multihop  packed  radio  analysis,  we  assumed  expo¬ 
nentially  distributed  packet  lengths.  We  have  been  able  to  generalize 
the  analysis  lot  packet  lengths  having  densities  formed  by  the  positive 
sum  ot  exponential  terms  (see  Appendix  C).  In  our  analysis  we  as¬ 
sumed  that  propagation  delays  among  neighboring  PRU's  are  negligible. 
Thus  in  a  Carrier  Serving  Multiple  Access  (CSMA)  mode  of  operation, 
collisions  may  occur  due  to  the  ''hidden  terminal"  phenomenon  only 
(i.e.,  two  non-communicating  PRU's  schedule  packet  transmissions  to  a 
common  neighbor  simultaneously). 

CSMA  analyses  incorporating  the  effects  of  propagation  delays  have 
been  reported  extensively  in  the  literature  for  single-hop  networks 
i.e.,  all  PR l  hear  i-ach  other)  and  lixed  packet,  sites.  As  a  first 
step  in  generalizing  these  results,  we  studied  single-hop  multiple  access 
protocols  with  non-tixed  packet  lengths.  Although  our  main  thrust,  is 
on  CSMA  packet,  radio,  we  also  derived  formulas  for  pure  ALOHA  and 
CSMA  with  collision  detection  (CSMA  CD).  I  he  former  was  a  necessary 
stef)  in  order  to  demonstrate  the  impact  ot  packet  length  distribution  on 
the  simplest  multiple  access  method,  whereas  the  latter  is  a  straight¬ 
forward  extension  of  pure  CSMA  and  is  especially  popular  in  local 
networking  environments.  \ote  that  the  pure  ALOHA  case  was  studied 
pi  evmusly  (  >  |  ,  whereas,  no  extension  has  been  reported  on  CSMA  to  our 
knowledge.  I  he  CSMA.  CD  lesult  is  so  simple  that  it  may  already  be 
known . 
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In  what  follows,  wo  summarize  the  variable  packet  length  analyses 
in  pure  ALOHA,  CSMA  and  CSMA/CD.  In  all  cases  we  assumed  infi¬ 
nitely  many  Poisson  sources  and  Poisson  aggregate  scheduling  pro¬ 
cesses,  with  rates  s  and  g  packets/sec  respectively.  Packet  lengths 
are  distributed  arbitrarily. 

B.2  Pure  ALOHA 

Referring  to  Figure  9,  we  consider  a  transmission  of  length  Y 
(shaded).  This  transmission  will  be  successful  if  a)  no  other  packet  is 
transmitted  in  Y  seconds  and  b)  no  previously  transmitted  packet  is 
still  transmitting.  We  are  assuming  zero  capture.  Calling  these  prob¬ 
abilities  Pa  and  P^,  we  obtain  s  =  gPgP^-  But 

00 

P  =  J  e"gy  fY(y)  dy  =  MY(-g) 
a  0 

where 

00 

MY(g)  =  J  egy  fY(y)dy 
0 

is  the  moment  generating  function  of  Y  and  fY(y)  is  its  density.  P^  is 
found  by  considering  the  '1'  second  interval  prior  to  the  transmission  in 
question.  Assume  transmissions  in  that  interval  occur  Tj  seconds 
before  the  start  of  our  test  transmission  and  have  length  Y..  Then 

Pb  =  lim  P( all  T-  >  Y.) 

T  *» 


But  the  number  of  transmissions  in  T  is  Poisson  and  all  are  identically 
distributed  and  independent.  Therefore 


P(all  T.  >  Y.)  =  I  [ P(T.  >  Y.))k  e'gT 

1  1  k=0  11  K 

-gT[l-P(Ti  >  Y.)]  -gTP(Tj  <  Yj) 
=  e  =  e 


Here  is  uniform  in  the  interval  (0,T)  and  Y.  is  distributed  as  Y. 


They  are  independent.  Thus 


P(T.  <  Y.)  =  4.  /  [1  -  F Y( t ) ]  dt 

1  1  1  0  1 


TP(T.  <  Y-)  -»  J  [1  -  F Y ( t ) ]  dt  =  E(Y) 

1  1  0  Y 


Here  Fy(y)  is  distribution  function  of  Y  and  E(Y)  its  expectation. 
Finally  we  have 

s  =  gMy(-g)  e"gE('Y-)  . 

Note  that  if  Y  is  fixed  then  s  =  ge”^gy  as  it  should. 

But  Y  is  the  length  of  the  transmitted  packets.  Condition  (b) 
above  does  not  involve  the  length  of  the  transmitted  packet  Y.  But 
condition  (b)  does!  Longer  packets  are  more  likely  to  suffer  a  colli¬ 
sion.  Let  X  be  the  length  of  the  offered  (or  successful)  packets.  Y 
should  in  a  sense  be  larger  since  longer  packets  are  retransmitted  more 
often.  Due  to  condition  (a)  alone  a  packet  of  length  x  will  be  suc¬ 
cessfully  transmitted  with  probability  e~gx  and  requires  an  average  of 
egx  transmissions  to  be  successful.  Thus 


fy(y)  =  egy  fx(y)/Mx(g). 


26 


and 


pa  =  1/Mx(g)  • 


Thus 


a  -gM  ,(g)/M  (g) 
S  =  e 


This  also  reduces  to  s  =  ge"^x  when  M  (g)  =  e^x.  This  result  has 
been  already  established  [5]  but  is  derived  here  in  a  different  manner. 


B  .3  CSMA 

Refer  to  Figure  10  for  CSMA.  After  an  idle  period  a  packet 
scheduled  with  rate  g  is  transmitted.  The  packet  lasts  for  X  seconds. 
In  the  propagation  time  a  after  transmission  any  other  scheduled  packet 
can  also  be  transmitted  thus  causing  a  collision.  We  assume  X  >  a. 
Again  we  assume  zero  capture.  If  no  such  packet  is  transmitted,  then 
the  original  transmission  is  successful,  lasts  for  X  seconds,  and  is 
followed  by  an  a  second  period  to  clear  the  channel  and  the  idle  state 
resumes.  Thus  the  successful  rate  is 


But 


s  =  ge  ga  P(channel  is  idle). 
P(channel  is  idle)  = 


_ 1Z2 _ _ 

1/g  +  a  +  E(Z) 


where  1/g  is  the  average  idle  time  and  Z  is  the  busy  period  exclusive 
of  the  last  a  seconds. 

To  evaluate  E(Z)  we  denote  the  times  of  the  transmissions  of 
interfering  packets  as  Tj  and  their  lengths  as  Xj.  The  number  of  such 
packets  is  exponential.  Collisions  depend  only  upon  the  propagation 
time  a  and  not  on  the  length  of  the  transmitted  packet  as  in  ALOHA. 


Z  =  max  {X,  Tj  +  X{}. 


and  Fz(z)  =  Fx(z)  I  [Fw(z)]  e‘93 


where  W  =  Tj  +  X.,  T.  is  uniform  in  (0,a)  and  is  independent  of  Xj, 
which  is  distributed  as  X.  Thus 


Fz (z)  =  Fx(z)  e 


-ga[l  -  Fw(z)] 


E(Z)  =  ;  [1  -  F7(Z)]  dz. 

0  ^ 


The  last  two  equations  can  be  used  to  find  E(Z),  although  not  easily. 


Finally 


1  +  ga  +  gE(Z^ 


B  .4  CSMA/CD 

Refer  to  Figure  11  for  CSMA/CD  (unslotted).  Here  collisions  are 
detected  and  transmissions  aborted.  Again  packets  are  scheduled  with 
a  rate  g.  After  an  idle  period  a  packet  is  transmitted.  If  no  packets 
are  transmitted  in  the  next  a  seconds  that  packet  is  successful.  After 
a  seconds  to  clear,  the  channel  returns  to  idle.  A  scheduled  packet 
can  be  transmitted  in  the  first  a  seconds  and  will  cause  a  collision. 
But  it  will  be  aborted  a  seconds  after  the  original  transmission.  The 
original  transmission  will  be  aborted  a  seconds  after  the  start  of  the 
colliding  packet.  Thus,  as  before, 
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FIGURE  10 


CSMA 
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In  a  previous  paper  [1]  we  developed  an  analytical  model  of 
multihop  packet  radio  networks  operating  under  CSMA  (Carrier  Sense 
Multiple  Access),  Poisson  arrivals,  perfect  acknowledgements  (i.e. 
acknowledgements  always  heard  and  not  taking  any  channel  capacity), 
perfect  capture,  and  zero  propagation  delay.  This  yields  a  Markov 
model  of  the  system  and  the  relationship 


s. . 

=  P(N.U  N.)  (1) 

g.j  J 

where  s..  is  the  required  successful  rate  (in  packets  per  unit  of 
transmission  time)  of  transmissions  from  node  i  to  node  j,  g^  is  the 
scheduled  rate  of  traffic  from  node  i  to  node  j,  P(A)  is  the  proba¬ 
bility  that  all  nodes  in  set  A  are  idle  (and  says  nothing  about  other 
nodes),  N.  is  the  neighborhood  of  node  i  (i.e.,  node  i  together  with 

all  nodes  which  can  hear  its  transmissions),  and  N.  is  the  neiqhbor- 

J 

hood  of  node  j.  Some  of  the  above  assumptions  about  the  network 
model  can  be  relaxed  to  include  a  more  general  class  ol  networks,  but 
we  use  the  simple  model  above  for  the  sake  of  clarity.  It  is  unlikely 
that  such  generalizations  would  significantly  affect  the  computational 
procedure  described  below. 

We  are  given  r..,  the  required  end-to-end  traffic  fiom  i  to  j,  for 
all  nodes  i  and  j.  We  assume  a  routing  has  been  done  yielding  hop- 


by-hop  requirements 


s. . . 

‘1 


We  are  also  given  the  connectivity  for  each 


node.  Equation  1  states  that  a  transmission  will  be  successfully 
received  once  it  is  scheduled  if  it  is  transmitted  and  it  no  neighbor  of 


2 


the  receiver  is  transmitting  at  the  time  transmission  begins.  (Our 
assumption  of  perfect  capture  allows  us  to  ignore  interference  from 
any  subsequent  transmissions.) 

In  our  previous  work  [1|  we  have  shown  that  the  probability  of 
a  set  of  nodes.  A,  being  idle  is  given  by 


P(  A )  =  1  n  G.  /  1  n  G. 
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(2) 


where  A  is  the  complement  of  A,  (i.e.  all  nodes  not  in  A),  1  is  an 
independent  set  (i.e.  nodes  which  all  cannot  hear  each  other),  and  G. 


is  the  total  scheduled  rate  from  node  i  (i.e.  G.  is  the  sum  on  j  of 

l  ’ 

g..).  We  refer  to  the  quantities  in  Equation  2  as  sums  of  products  on 
a  set  A  and  denote  it  by  SP(A).  We  can  thus  rewrite  (1)  as 


s. . 

JJ 

g.i 


SP(V-(N.U  Nj) ) 
SP(V)~ 


=  r..  (G) 


(3) 


where  G  -  (G,,  G2,...GM)  and  G.  =  Iq...  Then  we  can  write, 
1  ^  N  i  )  lj 


s. . 


gii  = 

■j 


(4) 


We  then  can  iteratively  compute  new  estimates  of  the  g..  given 

current  estimates  of  these  quantities  and  hence  the  G. .  Our  previous 

experience,  reported  in  |1|  showed  that  if  the  network  can  support 

the  given  s..  then  the  iteration  will  converqe  it  we  start  with  q..=s.  . 
J  ij  ^  i]  1 1 

Mote  that  g..  >  s... 

i)  g 

Gonceptually ,  then,  the  problem  is  solved.  One  need  only  set 


up  the  expressions  for  the  E..(G)  and  iterate  (4)  for  each  i  and  j 
until  successive  estimates  for  the  q..  converge  to  within  some  given 


tolerance.  This  was  done  for  small  networks  and  for  larger  networks 
with  symmetric  traffic  and  connectivity ;  results  using  this  procedure 
are  reported  in  [ 1 ] - [4 ) . 

For  larger  networks  without  symmetry,  however,  several  signifi¬ 
cant  problems  are  encountered.  Foremost  among  these  is  that,  in 
general,  the  number  of  terms  in  SP(V)  grows  exponentially  with  the 
number  of  nodes  in  V.  Specifically,  the  number  of  terms  in  SP(V) 
equals  the  number  of  independent  sets  in  V.  In  the  worst  case, 
when  all  nodes  are  independent  of  one  another,  SP(V)  includes  a  term 
corresponding  to  each  of  the  2^  subsets  of  V.  While  such  a  case 
cor:  esponds  to  an  unrealistic  situation  where  no  nodes  can  communicate 
with  one  another,  many  real  networks  are  loosely  connected  and  thus 
contain  a  very  large  number  of  indepent  sets.  Indeed,  this  number 
will  grow  exponentially  with  N  if  connectivity  (average  nodal  degree) 
does  not  increase  with  N.  This  can  be  seen  by  examining  any  regular 
topology;  e.g.,  a  grid. 

Even  if  the  number  of  terms  in  SP(V)  can  be  controlled,  say  to 
grow  quadratically  with  N,  there  are  still  several  problems.  Tirst, 
SP(A)  must  be  generated  not  only  for  A=V,  but  for  many  other  sets 
as  well.  In  particular,  each  F..  will  have  a  different  SP(A)  in  the 
numerator.  Also,  F^  must  be  evaluated  not  once  but  many  times 
before  the  iteration  converges.  Finally,  the  process  of  generating 
the  Fjj  must  be  automated  in  order  to  avoid  tediously  having  to  man¬ 
ually  input  the  individual  expressions. 

We  now  describe  a  procedure  whch  overcomes  ali  these  problems. 
We  first  observe  that  in  the  pathological  case  of  all  nodes  independent, 


SP( V )  can  be  written  as 


(5) 


r« 


N 

SP(V)  =  n  (1+G.) 

i=l  1 


It  this  expression  were  expanded  directly  into  a  sum  of  products  form 

N 

it  would  indeed  contain  2  terms  as  was  remarked  above.  As  it  is 
written,  however,  only  N  additions  and  N-l  multiplications  are  re¬ 
quired  for  its  evaluation.  While,  again,  the  situation  where  all  nodes 
are  independent  of  one  another  is  unrealistic,  the  observation  is 
nevertheless  important.  It  is  possible  to  make  the  evaluaton  of  SP(A) 
tractable  by  grouping  together  common  subexpressions  and  using  the 
distribution  of  multiplication  over  addition  to  reduce  the  number  of 
arithmetic  operations.  This  is  the  basis  for  the  procedure  described 
below . 

Every  independent  set  either  contains  a  given  node,  i,  or  it 
does  not.  If  the  set  contains  node  i  then  it  does  not  contain  any 
neighbors  of  i.  We  can  thus  evaluate  SP(A)  by 

SP(  A )  =  SP(A-i)  +  G.  *SP(  A-PT  )  (6) 

SP(4>)  =  1 

where  i  is  any  member  of  A. 

I A  I 

This  recursive  expansion  of  SP(A)  can  generate  up  to  2 
terms,  where  |A|  is  the  cardinality  of  A,  but  fewer  terms  will  be 
generated  in  general  since  A-N.  will  eventually  become  $  and  terminate 
the  recursion.  A  more  dramatic  reduction  in  the  number  of  generated 
terms  will  occur  if  we  recognize  sets,  A,  which  have  already  appeared 
and,  hence,  for  which  SP(A)  is  already  known.  In  this  case,  the 
known  value  of  SP(A)  is  used  directly  and  no  further  recursive 
expansion  is  required.  This  corresponds  exactly  to  factoring  out  a 
common  subexpression. 


-# 


* ; 


«• 
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As  an  example  of  how  this  procedure  works,  consider  the  net¬ 
work  shown  in  Figure  1.  We  begin  by  computing  SP(V).  The  recur¬ 
sive  expansion  of  SP(V)  is  shown  in  Figure  2.  Each  node  in  Tigure  2 
corresponds  to  an  expansion  using  (6)  Thus,  for  example,  the  root 
of  the  tree  corresponds  to 

SP( {12345} )  =  SP( { 12345 }  -  { 1 }  )  +  G. -SP( { 12345}-N x  )  (7) 

where  N  x  =  { 123 } .  The  G.  represent  the  same  term  appearing  in  (6). 
In  each  case  we  expanded  SP(A)  about  the  lowest  numbered  node  in 
set  A.  This  was  done  for  simplicity.  In  fact,  the  selection  of  which 
node  to  expand  about  is  significant  and  will  be  discussed  later. 
Nodes  followed  by  an  asterisk  in  Figure  2  correspond  to  terms  which 
have  already  been  generated  and  therefore  need  not  be  expanded. 
As  can  be  seen,  there  are  6  distinct  nodes  in  Figure  2. 

The  first  six  rows  of  Table  1  correspond  to  the  six  distinct 
nodes  in  Figure  2.  The  seventh  row  of  Table  1  and  the  column 
labeled  NEXT  will  be  explained  below,  as  will  be  the  procedure  for 
generating  Table  1.  The  first  five  columns  of  the  first  six  rows  in 
Table  1  represent  the  recursive  expansion  of  SP(V)  as  shown  in 
Figure  2.  For  example,  Row  1  in  Table  1  corresponds  to  Equation 
(7)  above  where  Terms  1  and  2  are  the  appropriate  SP(A) 
and  GMULT=G. . 

l 

If  the  column  labeled  VALUE  is  used  to  hold  numerical  values  of 
SP  for  the  sets  given  in  the  column  labeled  SET,  and  GVAL  ( i ) 
contains  the  current  numerical  value  of  G^ ,  the  current  estimate  of 
the  scheduled  rate  from  node  i,  then  Table  1  can  be  used  to  obtain 
the  numerical  value  of  one  set  given  values  of  others.  Specifically, 
VALUE(5)  corresponds  to  SP(4>)  and  is  equal  to  1  by  definition.  This 
value  can  then  be  used  to  obtain  VALUE(4): 
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VALUE(4)  =  VALUE  (TERM1(4))  +  GVAL  (GMULT(4)) 

•  VALUE  (TERM2(4)) 

=  VALUE(5)  +  GVAL(5)  *  VALUE(5) 

=  1  +  g5 

which  is,  of  course,  SP({5}). 

It  is  necessary  that  the  rows  of  Table  I  be  evaluated  in  the 
correct  order  so  that  the  appropriate  VALUE'S  are  available  when 
they  appear  as  terms  in  larger  expressions.  While  this  ordering  is 
not  unique,  there  is  a  precedence  of  rows  imposed  by  Table  1.  This 
precedence  is  shown  in  Figure  3.  An  arc  leads  from  node  i  to  node  j 
in  Figure  3  if  row  i  must  be  evaluated  before  row  j .  This  graph  is 
obtainable  directly  from  Table  1  using  the  columns  labeled  ROW, 
TERM1,  and  TERM2.  Physically,  we  represent  Figure  3  by  the  out¬ 
ward  adjacency  of  each  node  (i.e.  a  list  of  the  endpoints  of  arcs 
leading  out  of  it)  and  the  inward  degree  of  each  node  (the  number  of 
arcs  leading  into  it).  With  this  information  one  can  do  a  topological 
sort  of  the  nodes  (c.f.  Knuth[5]),  i.e.,  we  sort  the  nodes  by  current 
inward  degree,  continually  reducing  the  inward  degree  by  eliminating 
nodes  already  sorted.  Topological  sorting  is  a  simple  procedure 
whose  running  time  is  proportional  to  the  number  of  arcs  in  the 
precedence  graph  and  which  produces  a  list  of  nodes  in  precedence 
order.  In  this  case,  the  resulting  list  would  be  5, 7, 4, 3, 6, 2, 1 .  Note 
that  each  row  in  Table  1  can  be  dependent  upon  at  most  two  others 
and  so  the  number  of  arcs  in  the  precedence  graph,  the  size  of  the 
adjacency  list,  and  the  run  time  of  the  topological  sort  are  all  linear 
in  the  number  of  rows  in  Table  1. 


The  column  labeled  NF.XT  gives  the  next  row  to  evaluate  as 
given  by  the  topological  sort.  Row  5,  <•  orresponding  to  <j>  is  of  course 
first;  this  fact  is  recorded  in  a  var  ible  called  FIRST.  Thus,  Table 
1  allows  us  to  evaluate  SP(V)  simpiy  by  making  a  single  pass  through 
Table  1  and  doing  one  addition  and  one  multiplication  per  row. 

We  have  thus  solved  the  problems  of  how  to  automatically  generate 
a  functional  expression  for  SP(V),  how  to  avoid  having  to  reevaluate 
common  subexpressions,  and  how  to  simplify  the  iterative  calculation 
of  the  G. .  With  respect  to  the  last  point,  note  that  Table  1  is  set  up 
only  once.  The  evaluation  of  the  SP(A)  is  then  straightforward, 
requiring  only  K  additions  and  K  multiplications,  where  K  is  the 
number  of  rows  in  Table  1.  Indeed,  one  generation  of  Table  1  suffices 
for  the  evaluation  of  the  Gi  for  many  values  of  s..,  which  would  be 
useful  in  analyzing  variations  in  routing  and  offered  traffic. 

We  require  not  only  SP(V)  but  also  SP(V-(N.  y  N.))  lor  all  i  and 
j  which  are  neighbors  and  have  s^  >  0.  We  represent  all  these  other 
SP(A)  in  Table  1  in  exactly  the  same  way  as  we  do  SP(V)  and  we 
take  full  advantage  of  common  subexpressions  among  all  SP(A)  and 
SP(V).  Thus,  for  example, 

SP(V-(N2  U  N3))  =  SP(  {5} ) 

which  is  already  in  Table  1,  so  we  do  not  add  a  row  for  it.  Indeed, 
as  it  turns  out  in  this  case,  the  only  addition  to  Table  1  is  necessi¬ 
tated  by  SP(V-(N4  U  Ns))  =  SP(  { 1 }  )  .  This  expands  directly  in  terms 
of  SP(<{>)  and  hence  necessitates  the  addition  of  only  one  row,  Row  7, 
to  Table  1.  Note  that  the  topological  sort  and  evaluation  of  VALUE  is 
done  for  the  entire  table.  Thus,  for  each  iteration  on  the  G. ,  only 
one  pass  through  Table  1  need  be  made  to  obtain  all  necessary  SP(A). 


The  evaluation  of  F..  in  Equation  (3)  amounts  simply  to  dividing  the 
VALUE  in  the  row  corresponding  to  SP(V-(N.  \J  N.))  by  the  VALUE  in 
the  row  corresponding  to  SP(V).  The  appropriate  row  numbers  are, 
of  course,  recorded  in  a  table  once  during  the  procedure  which 
generates  Table  1 . 

The  actual  procedure  for  generating  Table  1  is  now  outlined.  A 
stack  of  sets,  A,  for  which  SP(A)  is  to  be  recursively  expanded  is 
maintained.  Initially,  the  stack  contains  V,  the  complete  node  set. 
It  is  convenient  to  maintain  such  sets  and  the  neighborhood  sets  N. 
as  bit  vectors.  At  each  stage,  the  set.  A,  at  the  top  of  the  stack  is 
popped  and  the  sets  A-i*  and  A-Nj*  are  formed,  where  i*  is  the  most 
desirable  node  in  A  to  expand  upon.  A  single  ordering  of  nodes  in  V 
is  produced  according  to  criteria  defined  below  and  is  used  to  deter¬ 
mine  i*  for  all  A.  Assume  therefore  that  the  nodes  in  the  network 
have  been  numbered  according  to  their  desirability;  i*  then  corre¬ 
sponds  to  the  leftmost  1  in  the  bit  vector  representing  A.  This  value 
should  be  recorded  along  with  the  set  A  so  that  it  can  be  used  as  the 
starting  point  for  the  search  for  the  best  node  in  the  sets  A-i*  and 
A-TT* .  This  value  in  fact  already  appears  in  the  GMULT  entry  in 
each  row. 

Initially,  Table  1  contains  only  one  row,  corresponding  to  V. 
The  SET  entry  in  row  1  contains  a  vector  of  N  l's.  As  a  set,  A,  is 
popped  from  the  top  of  the  stack,  its  TERM1  and  TERM2  entries  are 
set.  These  should  correspond  to  the  rows  in  Table  1  whose  SET 
entries  contain  A-i*  and  A-N.*.  A  search  of  the  SET  entries  cur- 

i* 

rently  in  Table  1  is  made  to  determine  if  these  sets  are  already 
present.  If  so,  the  row  containing  each  set  is  recorded  in  the  TERM! 
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and  TERM2  entries  of  the  current  row.  It  not,  a  row  is  added  to 
Table  1  for  each  new  set,  the  SET  and  GMU1.7  entries  are  tilled  in, 
and  the  new  sets  are  added  to  the  stack  of  sets  to  be  evaluated.  In 
practice,  the  row  number  in  Table  1  rather  than  the  set  itsell  is 

placed  in  the  stack  in  order  to  avoid  having  to  search  tor  it.  Also  a 
binary  search  tree  or  AVL.  tree  |ft|  structure  on  the  SI  T  values 
should  be  kept  to  facilitate  the  search  for  existing  sets.  An  alter¬ 
native  to  this  is  to  hash  these  entries  as  described  in  [6J. 

To  generate  entries  corresponding  to  SP(V-(N.  IJ  N.)),  these 
sets  are  treated  exactly  like  those  arising  above;  i.e.  a  search  is 

made  for  them  and  if  necessary  they  are  added  to  Table  1  and  to  the 
stack.  The  search  for  i*  in  such  sets  starts  at  node  1. 

We  thus  see  that  both  the  code  generation  procedure  and  evalua¬ 
tion  procedures  are  proportional  to  the  number  of  rows  in  Table  1  . 
There  are  other  factors  such  as  the  search  time  for  existing  sets,  the 
search  for  i*  in  each  row,  ard  the  generation  of  sets  V-{N. 

These  latter  factors  are  minor,  at  worst  polynomial  in  the  number  of 
nodes.  Our  primary  concern  is  then  with  the  number  of  rows  in 

Table  1,  which  can  theoretically  grow  exponentially  with  N,  the 

number  of  nodes. 

The  number  of  rows  in  Table  1  will  be  kept  down  if  sets  arise 

repeatedly  in  the  expansion  process.  This  is  more  likely  to  happen  if 

a  consistent  ordering  ot  the  nodes  is  used  in  expanding  all  sets. 

This  is  in  fact  done,  as  was  mentioned  above.  If  network  connec¬ 

tivity  is  related  to  distance,  a  node  ordering  based  on  location  will 
result  in  similar  subsets  being  generated  during  the  expansion  process. 
More  generally,  any  ordering  based  on  connectivity  is  likely  to  help. 


Alternatively,  an  ordering  based  upon  nodal  degree  may  be 
advantageous.  Expanding  on  nodes  with  large  degree  will  cause 
A-INF*  to  shrink  rapidly  to  <p  and  thereby  reduce  the  number  of  gen¬ 
erated  terms.  Conversely  if  node  i  has  no  neighbors  in  A,  then  A-i 
=  A-N.  and  only  one  subexpression  instead  of  two  is  created. 

Another  possibility  for  controlling  the  size  of  Table  1  is  to 
decompose  a  large  network  by  numbering  all  the  nodes  in  a  cut  (sep¬ 
arating  set)  first,  then  numbering  all  the  nodes  on  one  side  of  the 
cut,  and  finally  numbering  all  the  nodes  on  the  other  side,  as  in 
shown  in  Figure  4.  This  guarantees  that  once  expansion  of  the  nodes 
in  the  cut  (nodes  1  through  K)  is  done,  the  remaining  sum  of  products 
will  factor  since 

SP(AU  B)  =  SP(  A  )  *SP(  B )  (7) 

if  A  and  B  are  completely  disconnected  from  one  another.  In  par¬ 
ticular,  it  can  be  shown  that 

SP(  V)  =  ii  ((a  G  )  SP(  V-N, )  ]  (8) 

ICC  iel 

where  C  is  any  subset  of  V,  I  is  any  independent  subset  of  C,  and 
N  |  is  the  union  of  the  neighborhoods  of  nodes  in  I. 


Fig.  4  -  Decomposition  of  a  network  using  a  cut 


l.'i 


If  C.  is  a  cut,  dividing  V  into  disjoint  sots  A  .uni  B  with  no  links 

between  them,  as  in  Figure  4,  then  liquation  (8)  becomes 

SP(V)  =  2  1 1  n  tJ.)  Sl’(  A-N  )  SB(  B-N .  )  1  ( 9  > 

IC{:  ii  i  1  1  1 

Thus,  the  sum  of  products  "factors".  The  numbering  scheme  will 
ensure  that  the  nodes  in  B-!\l  are  iett  for  las!  and  hence  become  a 
common  subexpression  for  all  subsets  of  SP(A-Nj).  Thus,  the  factors 
in  (9)  will  be  evaluated  separately  and  multiplied  together  at  the  end. 
This  technique  can  be  applied  recursively  to  decompose  very  large 
networks.  It  appears  to  have  the  property  that  Table  1  will  grow  at 
worst  exponentially  in  the  size  of  the  cuts. 

We  have  coded  a  preliminary  version  of  this  procedure  in  order 
to  perform  experiments  to  determine  the  growth  rate  of  Table  1  as  a 
function  of  N  using  various  types  of  connectivity  and  node  orderings. 

Thus  far,  the  following  conclusions  and  observations  have  been  made: 

1  .  Random  X  and  Y  coordinates  were  generated  for  N  nodes  in 

a  unit  square.  Euclidean  distances  were  computed.  A 

threshold,  T,  was  varied  from  0  to  2.  \odes  at  distance  T 
or  less  were  said  to  be  connected.  Experiments  for  N=1  r> 
and  N-dO  were  run.  Nodes  were  ordered  based  on  the  sum 
of  their  X  and  Y  coordinates.  Bor  N  la,  '1  able  1  typically 

had  about  80  rows,  lor  S R( V )  only.  Tor  N-dO,  Table  1 

typically  had  about  120  rows.  The  conclusion  we  draw  on 

the  basis  of  this  limited  data  is  that  Table  I  does  not 
appear  to  be  growing  exponentially.  Indeed,  it  seems  to  tie 
growing  only  slightly  faster  than  linearly. 


2.  For  random  connectivity  and  node  ordering  based  on  nodal 
degree  (largest  first  )  roughly  80  terms  were  generated  for 
N=15  and  roughly  600  terms  for  N=30.  These  results, 
based  on  few  runs,  are  inconclusive. 

3.  Crid  networks  (nodes  evenly  spaced  over  a  unit  square)  of 
25  nodes  and  varying  connectivity  were  examined  using 
distance  based  connectivity  as  in  1  above.  Table  size 
varied  with  connectivity,  with  the  worst  case  being  120 
terms  at  a  nodal  degree  around  b.  Again,  the  conclusion  is 
that  table  size  does  not  seem  to  be  growing  very  rapidly. 

We  are  currently  implementing  a  more  complete  version  ol  the 
procedure  in  order  to  do  more  extensive  testing. 
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1.  INTRODUCTION 


Packet  radio  has  recently  emerged  as  a  viable  technology  for  both  fixed  and  mobile 
computer  communications.  It  utilizes  packet-switched  communications,  which  became 
highly  important  in  computer  communication  networks,  over  broadcast  radio  channels. 

The  original  packet  switching  concept  was  based  on  point-to-point  data  circuits 
interconnecting  network  packet  switches.  With  technology  advances  in  satellite  and  ground 
radio  broadcasting,  packet  radio  networks  have  become  an  area  of  major  interest  [6]  . 
During  the  early  1970’s,  the  ALOHA  project  at  the  University  of  Hawaii  demonstrated  the 
feasibility  of  using  packet  broadcasting  for  bursty  computer  traffic  [1] . 

In  a  broadcast  radio  network,  channels  are  shared  by  a  set  of  nodes  and  cannot  be 
dedicated  to  specific  pairs.  Numerous  studies  ( [l] ,  [6] ,  [7] ,  [5]  ,  and  [9]  )  have  shown 
that  "fixed  capacity  assignments"  are  wasteful  for  many  applications  compared  to  "random 
access  schemes"  which  result  in  a  dynamic  sharing  of  channel  capacity  without  centralized 
control.  These  studies  focused  on  a  single-hop  network,  whereby  Packet  Radio  Units  (PRU's) 
attempt  to  transmit  data  to  a  single  central  station.  In  conjunction  with  the  fact  that  there 
is  no  control  over  access  to  the  channel,  destructive  errors  result  when  several  packets  are 
received  simultaneously  or  "collide."  These  collisions  reduce  the  channel  throughput  under 
random  access.  v 

The  analysis  of  multihop  packet  radio  networks  involving  packet  routing  via  relays,  is 
extremely  difficult.  So  one  must  use  simulation  [9]  ,  [10]  or  study  simple  models  to 
measure  network  performance,  mainly  the  network  capacity  or  network  throughput  [8]  , 
[14],  and  [15].  The  throughput  of  muitihop  slotted  ALOHA  network  was  studied  recently, 
[12] ,  and  it  was  found  that  one  of  the  most  important  factors  affecting  it,  is  the 
transmission  radius  (or  transmission  power)  used  by  the  nodes.  To  increase  the  throughput, 
one  may  even  want  to  reduce  the  transmission  power  to  a  level  lower  than  the  maximum 
possible. 

Techniques  which  are  used  to  increase  the  network  throughput  include  Carrier  Sensing 
Multiple  Access  (CSMA)  [13] ,  and  resolution  of  collisions  via  packet  capture  mechanisms. 
The  effect  of  capture  in  increasing  the  network  throughput  was  studied  in  [2]  and  [11]  for 
single-hop  networks.  In  [4]  the  effect  of  capture  on  the  throughput  was  investigated  for 
multiple  transmitters  and  receivers  in  a  slotted  ALOHA  environment. 

The  problem  of  obtaining  exact  expressions  for  throughput  and  blocking  probabilities  in 
multihop  packet  radio  networks  using  CSMA  access  method  was  considered  in  [3]  .  Exact 
results  were  obtained  assuming  exponential  packet  lengths  for  general  topologies.  This 
analysis  has  been  extended  to  constant  packet  length  for  selected  simple  topologies. 


In  this  work,  we  consider  multihop  packet  radio  networks  using  slotted  j*nd  unslotted 
ALOHA  access  methods  with  and  without  capture.  We  consider  a  set  of  ident.cai  PRU's 
randomly  (uniformly)  distributed  in  the  plane.  We  find,  via  analytic  techniques  and 
computer  experimentation,  that  the  optimal  density  of  radio  units  which  maximizes  the 
throughput  of  the  network  is  the  same  for  all  access  methods  considered  and  corresponds  to 
the  previous  found  value  for  the  slotted  ALOHA  case  without  capture  [12]  . 


1.2 


2.  SLOTTED  CASE  MODEL 

The  nodes  of  the  network  are  assumed  to  be  uniformly  distributed  with  density  X. 

Two  access  methods  are  considered: 

-  No  capture  (slotted  ALOHA). 

Capture  (closest  neighbor). 

We  consider  a  multihop  packet  radio  network,  with  transceivers  uniformly  distributed  in  a 
large  geographical  area.  Packet  Radio  Units  (PRU's)  have  equal  transmission  range.  Any 
node  within  the  circle  of  radius  r  around  a  transmitting  node  will  hear  the  transmission  and 
can  respond  to  it.  Each  node  always  has  a  message  to  send  and  will  do  so  whenever 
permitted,  (see  Figure  1). 


Range  of  reception  for  terminal  i. 


Number  of  terminals  in  the  area  A; 


Probability  of  a  packet  transmission  during  a  slot  by  a  terminal. 


Probability  of  successful  reception  of  a  packet  from  a  transmitting 
terminal  of  the  area  A:. 


qj  =  Pr  {  a  neighbor  transmits  successfully  to  i/Aj  |  =  P1  .  P2  .  *V 
where  Pj  -  probability  that  i  does  not  transmit. 


Pj  =  1  -  G 


Probability  that  a  neighbor  from  area  A.  transmits  to  i  and  i  can  receive 
the  packet. 


2.1 


/Ni\ 

P«  3  l  I  G  (1  -  G)  -  no  capture  (one  transmission) 

w 

N. 

l 

Pj  *  1  -  (1  -  G)  -  capture  (one  or  more  attempts  made) 

Pj  -  Probability  that  captured  packet  is  destined  to  i,  while  source  has  Nj 
options  to  address. 


Therefore, 

a.  No  capture  (slotted  ALOHA): 

Ni 

qj  =  G(1  -  G) 

b.  Capture: 

V  ^  t""-"'] 

Assuming  that  the  number  of  terminals  in  an  area  A  is  a  Poisson  random  variable  with 
density  X,  we  have  that  S  (the  throughput  to  a  node)  is  given  by: 

-  (XA)1  e~XA 

S  =  I  - 

i  3  1  i! 

XA  =  N  -  Average  degree. 

Therefore, 


a.  No  capture: 


nae 


S*Ge'NG  -Ge'N 


b.  Capture: 

S  «  (1  -  G)  e"N  £  — 1-  fi  -  (l  -  G)1 1 

i=l  i!  i  I  J 


N. 


- 


The  maximum  throughput,  will  be  achieved  with  G  =  G^,  by  setting: 
dS 

as*0 

The  total  network  throughput  Snfit,  will  be: 

W  ‘  s 

Where  n  is  the  total  number  of  nodes.  The  average  number  of  hops  in  an  end-to-end  path  h 
is  given  in  [12]  : 


r  _  128 

h  '  45* 


(*)' 


_K  \  -N  fco.^O.tvCTl 
1  +  e  •  Je  7  L  Jdt 


-1 


•I 


Therefore,  the  end-to-end  throughput  Y  is: 
S. 


net 


To  find  G 


opt' 

a.  No  capture:  it  is  given  in  [12] . 


b.  Capture: 


S*(l-G)e‘N  Z  —  fl-d-G)1]  ; 

i=l  i!  i  L  J 


2.4 


3. 


UNSLOTTED  CASE  MODEL 


1TELC 


For  the  unslotted  case,  the  following  schemes  are  considered: 

-  Pure  ALOHA  (All  PRU’s  transmit  at  random.  Collided  receptions  result  into  loss 
of  all  colliding  packets). 

-  Perfect  Capture  (In  case  of  collision,  the  receiver  keeps  tracking  the  packet  it 

received  first.  This  is  possible  under  a  broad  range  of  spread  spectrum 
technologies).  #< 


-  Half-Amplitude  Capture  (In  case  of  collision,  the  receiver  will  listen  to  the 
closest  transmitter). 

As  in  the  unslotted  case,  we  assume  that  PRU’s  are  randomly  located  in  the  network  area. 
The  rate  of  new  packets  and  total  rate  of  packets  (including  retransmissions)  between  PRU's 
separated  by  a  distance  d,  are  given  by  Sd  and  Gd  respectively. 

The  following  areas  are  defined  (see  Figure  2): 

A^  -  Inside  the  contour  D-E-F-C-H-D. 

Aj  ■  Inside  the  contour  D-K-C-F-G-E-D. 

A  ^  ~  Inside  the  contour  D-J-F-G-E. 

A^  -  Inside  the  contour  E-G-F-A-E. 

Ag  -  Inside  the  contour  D-E-A-F-C-H-D. 

These  areas  can  be  expressed  as  functions  of  communication  radius  (r)  and  variable  angle  0 

A1  =  itr2 

2  r2 

A2  3  nr  ~  T~  (2  91  “  sin  2  Q\)  ’  2 


3.1 


•i 


• \ 


*4 


•4 


i 


nac_ 


0,  if  d  <  r/2  or  9^  >  cos-1  ^ 

d2  r2 

~  (2  Qj  -  sin  2  <3j)  -  y-(2  0  2  -  sin  2  02)  = 


1 2  cos2  g .  ("4  cos  1  / — ..-1 ...  _\1  -  sin  [4  cos”1  ( — - _ \ 

1  L  \  4  cos  0  ^/J  L  \4  cos  /  . 

os-1  ^ -  *  l  sin  [2  cos'1  f-J: - \  1 1 ; 

4  cose1  L  \4  eosGiy  J  f 


Since, 


d 

9. 


2r  cos  0^; 

cos'1  ~  ;  and  0  j  *  2  02; 


/  9  2  2  -It 

wd  *  4  itr  cos  0  ^  ,  if  ©1  >  cos  ± 


ird2-A3  =  irr2 


itr2  -  A, 


’  2  a  1  A3  " 

4  cos  9 1  ’  T  T  ; 

L  r  -» 


itr 


£l  *  4  cos2  0  J  ;  if  ©  ^  >  cos”1  | 
nr2  [l-4cos2et*  -J-  ^f]  , 


Therefore,  the  joined  probability  of  the  number  of  nodes  in  the  areas  above  is: 


(AA.)  1  (AA-)  2 

KnvnJ  =  - - - - -  -A(A.  +  A_) 

nl*  n2! 


(AA.)  1  (AAJ  J 

P(n.,nJ  =  - i - - -  -A(A+AJ 

13  n  « n  •  e  1  3 

nl’  n3' 


3.3 


'j 


*4 


■•4 


*4 


-*4 


nac 


P(n3,  n4,  n5) 


(VA.)  3  ( A  A  ,)  4  (AA.)  5 
3  4  o 


n3!  a4!  n5! 


ri(A3  *  ■'j  *  a5> 


n .  n. 

(XA.)  4  (\A,)  3  ua  +  a  \ 

P(n.,ns)  =  - 4 - 3 _  e'X(A4"A5) 

V  "s’ 


Where  =  population  of  itn  area. 

Using  the  average  degree  definition: 

N  =  An  r2 


we  obtain: 


P(nlf  n2)  = 


M("l  *  n2> 

nl!  n2! 


(,-!i 


sin  2  Q . 


H 


sin  2  9  j  \  i 


N("l  *  "s’  /A3\  ‘  ,  f  A,\i 

P(nl’n3>  =  ~  "  t  T 3  )  exp  J  -N  ~  -Jjjj; 


n4,  n5)  - 


V  n3!  n 


N(n3  *  n4  *  "S> 


n,!  n  '  n  ' 


3’  4'  5' 


1-4  cos2  0,  — 

1  n 


nl  /  \ 

i  /A,\  f  2  1  /  A3\  1 

X  (  J  )  Ucos2  9  -  x  1-4) 

*  V  r V  L  X  r-y  J 

1?)  exp  { -N  *  X  £)}, 
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P(n4,  n5) 


n5) 


V 


n5' 


COS  J 


1  -  4  cos"  0 , 


In  order  to  compute  the  throughput  (successful  transmission  rate  S^)  between  PRU  1  to 
PRU  2  separated  by  a  distance  d,  the  following  observation  is  made:  For  PRU  2  to  receive 
correctly  a  packet  (of  duration  normalized  to  1),  the  packet  should  be  transmitted  from: 

Area  (A ^  UA2)  during  jt  -  1,  t  +  lj  for  pure  ALOHA,  (t  is  the  time  of  packet 
transmission). 

AreafAj^  UA2)  during  jt  -  1,  t j  for  perfect  capture. 

-  Area  (A,  UA  .  UA. 

3  4a 

The  probability  of  successful  transmission  Sd/Gd,  will  be  given  as  follows: 


)  during  jt  -  1,  t  +  1  j  for  half  amplitude  capture. 


Pure  ALOHA: 


Sd 


Gd 


L 

>2 

>° 


.  -2G  (n,  +  n«) 

i  e  *„in2> 


Perfect  Capture: 


£  _1_  e”G^nl  +  n2} 

nj  >  2  nl 
n2  1° 


P(n^,  n2) 


Half  Amplitude  Capture: 


-2G  (n. 


n3) 


P(n3>  n4,  n,.); 


if  A  j  exists. 


£ 


n4  >  2 
nS>° 


1 


n4  +  n5 


-2G  n. 

®  lij); 


if  a3  =  0. 


Using  the  expression  for  probabilities,  transformation  d  — •  0  ^  (see  below),  and 
unconditioning  with  respect  to  0 


Pure  Aloha: 


S  = 


E  Ge 


-2G(n1+n2)  N 


(nj+nj)  ff/2 


nj  >  2 


nl'n2'nl 


/  2  sinQj  (  1 

it/3 


20x  sin  20j 


n2>° 


exp 


d0. 


Perfect  CaDture: 


S  = 


-Gfn.+n,)  N 
E  G  1  2 


(nj+n2)  rt/2 


*  *  *  C  f  20,  sin  2  0,  \ 

I  ,n  J  2  sin  0 .  1-— i  ♦  - - 

n,.n0.n.  J  1  \  tt  «  / 


nac 


Half-Amplitude  Capture: 


Ge 


■2G(n3+n4)  N 


n4  >  2 

n3»n5  £  0 


n3!n4!n5!  (Vn5} 


cos"1  7  =  1.32 


/  **9i(±  t) 


'  1 

4  cos"  0 , 


-f  =1.043 


i_  A;1 

*  r2 


2  1^3 

1  -  4  003^  ♦  ~  — - 


exp  J-N(l>i  |  <*Ql  * 


£  Ge 


<nA+n!;^ 

■2Gn4  N  4  5 


ff/2=1.57l 


n4  >  2 

n5>0 


B4!  nS!  (Vn5>  cos'1  1  =  1.32 


(\  4 
4  cos2  0  j  J 


2  3  -N 

(1-4  005^0^  e  do^* 


To  find  G^,  we  set: 


This  equation  is  solved  numerically  using  Newton's  method: 


G^+1^  = 


f(G(i)) 
f  (G(i)) 


for  given  error  value  allowed. 

Using  fi ,  [12] ,  we  obtain  the  throughput: 


Y  =  -2SL  ;  where  Sngt  =  n  •  S 

R 

Tranformation  d  —-*9^ 

Since  d  is  uniformly  distributed  (see  Figure  3)  its  distribution  will  be 


4. 


NUMERICAL  RESULTS 


The  analyses  described  above  were  implemented  in  computer  programs  in  order  to 
compute  the  maximum  throughput  for  various  average  degree  values.  These  programs  for 
the  unslotted  cases  involve  considerable  computational  complexities  and  excessive  run 
times.  We  have  been,  however,  able  to  obtain  the  basic  results  which  are  portrayed  in 
Figure  4. 

Y 

In  Figure  4  we  show  the  normalized  throughput  —  as  function  of  the  average  degree 

\/n 

N.  For  slotted  capture  ease,  the  value  of  N,  which  maximizes  the  throughput,  is  9  or  10,  at 
which  point  the  optimal  network  throughput  is  0.19  Vn  .  The  maximizing  value  of  N  for  the 
four  other  cases  is  approximately  6,  which  is  the  same  result  as  in  [12] . 

Note  that  the  significance  of  the  optimal  degree  can  be  interpreted  as  what  should  be 
the  density  of  PRU's,  in  a  deployment  scenario  in  order  to  achieve  maximum  throughput. 


FIGURE  4 


5.  FURTHER  RESEARCH 


The  approximate  nature  of  the  models  studied  was  justified  from  the  need  to  obtain  a 
tractable  solution  to  the  optimal  density  of  radio  units  in  a  deployment  scenario.  The 
following  extensions  can  be  performed  using  similar  assumptions  on  the  random  topology  and 
disregarding  boundary  effects: 

i 

-  Model  Carrier  Sensing  Multiple  Access  (CSMA)  with  and  without  capture.  Ho 
exact  models  for  CSMA  have  been  so  far  obtained,  due  to  the  non-Marcovian 
nature  of  the  resulting  packet  transmission  process  (see  [3] ).  Approximate 
methods  can  be  devised  or  additional  assumptions  on  exponential  packet  lengths 
can  force  a  Markovian  birth-death  approach  as  in  [3] . 

-  Model  flood  routing,  whereby  every  PRU  relays  all  received  packets,  regardless 
of  their  ultimate  destination. 

Further  research  should  address  the  impact  of  specific  topology  and  routing  instead  of 
the  random  dispersed  PRU’s  over  an  infinite  area.  More  specifically,  in  [3] ,  Boorstvn  and 
Kershenbaum  proposed  a  methodology  for  modeling  CSMA  packet  radio  networks  with  or 
without  capture,  under  the  assumption  of  exponential  packet  length.  The  generalization  of 
the  method  to  fixed  or  general  packet  length  distribution  is  still  an  open  issue. 

Other  areas  of  interest  are: 

1.  Flow  control  techniques  to  achieve  maximum  throughput. 

2.  Feasibility  of  using  packet  radio  networks  for  packetized  voice. 
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Abstract—  *\  adaplivr  rmilini*  vhrnu  fur  piakil  sMilihrd 

lompulcr  oimmunicylinn  nrUurks  is  proposed  ^nd  in*  est  i^alrd  1  hf 
first  Irsrt  is  quasi-static  and  bastd  on  (hr  glnhiil  nriwork  status.  Thr 
second  U*s f I  is  dsnamic  with  decisions  heme  m^dr  at  each  node  in  an 
attempt  to  obtain  the  savings  in  asira^e  dda>  predicted  bv  a 
mulliservcr  model  of  the  node.  Simulations  confirm  the  predicted 
improvement. 

I.  INTRODUCTION 

MOST  adaptive  routine  schemes  perforin  about  as  well  as 
nonadaptive  schemes  when  evaluated  in  a  fixed  enviion- 
ment  |lj.  However,  they  do  adapt  to  changes  in  network 
topology  and  input  statistics  The  reason  they  do  not  show  a 
significant  improvement  is  that  they  are  actually  quasi-static 
in  that  they  sense  and  respond  to  the  above  changes  slowly. 
Their  goal  is  to  select  good  paths  [2] . 

We  focus  our  attention  on  the  node,  which  may  be  viewed 
as  a  multiple  server  queueing  system.  Most  adaptive  routing 
schemes  operate  the  node  as  a  collection  of  single  server 
queues.  If  the  node  is  operated  as  a  multiple  server  queue,  an 
advantage  at  the  node  of  a  factor  approximately  equal  to  the 
number  of  servers  (outgoing  branches)  can  be  obtained.  How¬ 
ever.  the  control  over  good  paths  may  be  lost.  We  show  that 
we  can  simultaneously  obtain  both  good  paths  and  improved 
node  performance.  These  improvements  in  time  delay  exist 
for  the  network  as  well.  Consequently,  network  bandwidth 
can  be  significantly  reduced.  Furthermore,  the  approach  lends 
itself  to  analysis.  We  also  discuss  limits  and  extensions. 

Many  computer  communication  networks  (ARPANET. 
TELENET.  TYMNET,  etc.)  use  dynamic  routing  schemes  lo 
compensate  for  input  traffic  variations,  to  respond  to  changes 
in  topology,  and  to  take  advantage  of  temporary  changes  in 
loading  in  different  paths.  During  the  design  of  a  network, 
they  are  replaced  by  analytically  tractaole  nondynamic  (static) 
schemes.  We  present  here  a  routing  scheme  for  wluch  we  have 
been  able  to  derive  approximate  analytical  models.  Further¬ 
more.  we  can  establish  the  efficiency  of  this  scheme,  especial!, 
in  heavily  loaded  situations. 

A  typical  static  routing  scheme  would  operate  as  follows. 
Consider  as  separate  commodities  the  message  originating  at  a 
particular  node  anu  destined  lot  a  second  node  in  the  network 
The  static  routing  scheme  would  specify  the  optimum  propor- 
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tioi.  of  traffic  to  he  routed  over  each  path  idlicieil!  algorithms 
exist  tor  design  ol  this  type  of  routing  [3]  . 

We  can  identity  one  particular  problem  with  this  approach 
Although  good  paths  are  indeed  found,  any  node  essentially 
operates  as  a  collection  ol  single  server  queues-  one  queue  for 
each  outgoing  branch.  Considering  the  node  as  a  queue  with 
several  potential  servers,  this  is  not  an  efficient  manner  of  op¬ 
eration.  Indeed,  if  a  node  had  k  outgoing  branches  and  were 
operated  as  a  queue  with  k  servers,  then  the  time  delay  would 
be  reduced  by  a  lactor  of  approximately  k.  Conversely,  the 
throughput  can  be  increased. 

If  the  node  were  operated  as  suggested  above,  messages 
would  wander  aimlessly  through  the  network  and  the  total 
performance  would  be  abysmal.  Our  approach  is  to  retain  the 
good  paths  for  commodities  and  yet  still  get  the  benefit  of  the 
faster  performance  at  the  node. 

Briefly  our  scheme  is  as  follows  [4] .  Consider  a  node  as  a 
single  queue  with  several  servers  (output  channels).  For  a 
particular  commodity,  i.e.,  a  message  with  a  certain  destina¬ 
tion,  the  use  of  some  of  these  servers  would  cause  the  mes 
sages  to  be  sent  along  “bad"  patlis-eitiier  too  long  or  too 
congested.  Thus,  for  each  commodity  and  at  each  node  we 
specify  a  subset  of  the  output  channels  as  allowable  and 
permit  the  message  to  use  any  allowable  channel  according  to 
some  discipline.  F.ach  commodity  appearing  at  the  node  has 
its  own  allowable  set  of  channels  There  restrictions  force 
messages  to  use  “good”  paths.  It  has  already  been  noted  [5j 
that  an  advantage  can  be  obtained  if  a  selection  between 
relatively  good  paths  is  not  forced,  and  the  adaptive  routing 
scheme  continues  to  allow  each  of  these  to  be  used. 

The  assignment  of  allowable  branches  at  each,  node  for  each 
commodity  is  one  level  of  our  adaptive  touting  scheme  These 
assigaments  are  based  on  essentially  global  information  of 
topology,  flows,  and  long  term  averages,  and  may  be  adaptive 
in  a  quasi-static  way.  The  second,  and  truly  dynamic,  level  of 
our  routing  strategy  is  local  and  involves  the  queue  discipline 
at  each  node.  As  an  example,  consider  a  node  with  two  out¬ 
going  channels  (servers).  All  message  commodities  fall  into 
three  classes.  Two  of  these  must  use  only  one  of  the  servers 
and  have  no  choice.  Messages  |oin  the  appropriate  queue  and 
are  served  tn  turn  The  third  class  of  messages  may  use  eithet 
of  the  two  servers  and  join  a  third  queue 

Another  strategy  at  a  node  is  to  give  priority  to  the  rues 
sages  that  have  no  choice,  i  e..  arc  de, heated  to  one  of  :he 
servers  Yet  another  strategy  is  lo  allow  message'  in  the  thud 
category  ( r.ondcdie  jlcd )  lo  join  the  shot  ter  ol  me  two  dedi 
cated  queues  We  have  evaluated  the  per  lot  nance  of  bo’h 
sli.iiecies.  jud  other  similar  su.m  eies  tot  this  simple  two 
sever  mo  lei.  lo;  . re  lOinpi.  s  n.n  e  tv.iei.,  and  lor  simple 
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networks  The  measure  ot  performance  was  the  average  lime 
delay  for  all  messages  traversing  the  node 

II  NLTWORk  MODI  1 

We  consider  a  networis.  actually  the  backbone  of  a  distrib 
uted  (store-and-forward )  packet  switched  data  communica¬ 
tions  network,  as  a  collection  of  nodes  connected  by  lull 
duplex  links-comprising  the  topology  The  input  traffic  is 
described  by  a  matrix  of  rates  (packets/second)  of  traffic 
originating  at  one  node  and  destined  for  another  Index  die 
nodes  by  1,  ••,  A'  and  denote  the  links  between  nodes  /  and  / 
by  (/,/).  Alternately,  index  the  links  by  1,  .  /.  (We  assume 

N  nodes  and  L  links.)  Denote  the  input  traffic  (from  source  t 
to  destination  j)  by  yt/  (packets/second)  Assume  (for  simplic¬ 
ity)  that  the  average  length  of  all  packets  is  !/p  bits  Assume 
further  that  all  input  streams,  for  each  of  the  ,V(Ar  —  1)  node 
pairs,  are  independent  and  Poisson.  Let  the  link  capacities  be 
given  by  C,. 

A  nonadaptive  routing  is  given  by  a  set  of  paths  for  each 
commodity  (node-pair  traffic)  and  the  proportion  of  input 
traffic  using  each  path.  From  this,  the  flow  in  each  link  in 
each  direction,  X/,-  .)  =  \t  (packets/second),  can  be  found. 
Note  that  y  =  S,  is  the  network  throughput,  or  total 
offered  load,  and  X  =  I,  /X(,  /)  is  the  total  internal  traffic. 
Furthermore,  note  that  /  =  \Jy  >  1  is  the  average  path  length 
in  the  network 

We  assume  exponential  packet  lengths  We  further  make 
the  "independence”  assumption  [6 1  that  at  each  node  in  a 
path,  a  new  random  assignment  of  the  packet  length,  with  the 
same  distribution,  is  made.  This  idealization  ieads  to  the  prop¬ 
erty  that  each  node  can  be  modeled  independently  as  simple 
M/M/l  or  M/M/k  queues.  It  has  been  shown  that  this  approxi¬ 
mation  yields  valid  results  for  many  network  situations  [6] . 
Some  care  must  be  taken  in  situations,  such  as  adaptive 
routing,  where  dependence  between  nodes  is  a  key  factor.  We 
will  ignore  this  warning  until  the  end,  when  we  comment  upon 
its  implications,  and  assume  the  above  type  of  independence  at 
each  node. 

Many  streams  of  traffic  converge  at  a  node-from  incoming 
network  links  and  from  outside  the  network  Some  of  this 
traffic  is  destined  for  this  node,  leaves  the  network,  and  will 
not  concern  us  at  this  node  The  remaining  traffic  musr  be 
routed  to  an  appropriate  outgoing  link.  A  nonadaptive  strategy 
decides  what  proportion  of  such  traffic  uses  each  link  on  a 
commodity-by-commodity  basis.  Thus,  each  link  can  be 
modeled  as  an  M /Ml  1  queue  and  has  link  average  time  delay 
T,  =  l/(fiC,  —  X,).  The  average  time  delay  in  the  network  is 
T  =  (S,X,7V)/7  If  we  define  rhnk  -  (X,X;r,)/A  as  the  time 
delay  of  a  typical  link,  then  T  -  /7)lnk 

III.  NODF  MODLL 

A  node  in  a  network  has  several  outgoing  links,  say  k\  it 
is  said  to  be  of  degree  k  In  nonadaptive  routing,  the  node 
operates  as  a  collection  ol  single  server  queues  which  we 
can  model  as  M  Vf/|  queues  Ihus  the  time  lelav  for  link 
/  is  T i  =  1  H^Ci  -  \|)  Hie  time  delay  tor  node  n  is  7'(„,  ~ 
Z,\n jT(„  ,,/X(fl)  where  A,*,  =  L,A„  ,  is  the  rate  of  traffic 


a  '  > 

that  node  n  sends  onto  me  network  The  network  time  delav 
is  T  —  L„\„)7 ,r. ,  y  v  hiv!:  fives  tl;<-  same  rewi!'  as  above  but 
expressed  in  loom  ol  ium.iI  Jclav 

Ignoring  routing  to*  j  moi  .eni.  m<-  tio.ic  ^jii  be  viewcU  as  a 
multiple  server  queue  ( with  k  -.eivcr.l  I' »c  a,,,u;.f,i,;  i 
at  this  node,  and  i!  all  packets  join  one  laiae  ,j.,eue  ami  j;e 
served  by  any  available  server .  then  we  have  an  M  M,\  .j«.r  . 
with  average  waning  time  at  node  n  gnen  by 


J_  f_b  J_ 
pC  k  I  —  p 


(I  l 


where  p  =  and  PB  is  the  probability  that  all  servers 

are  busy.  Although  PB  can  be  enumerated  given  p  and  k.  the 
only  property  we  need  here  is  that  PB  -*  1  as  p  -  1 . 

Contrast  the  above  result  to  that  for  a  simple  server  queue 
where,  with  p  -  X;/pC,  the  waiting  time  at  Link  /  is 


(2) 


We  see  that  the  multiple  server  operation  has  a  waiting  time 
advantage  of  at  least  k.  The  time  delay  is  the  waiting  time  plus 
the  service  time  (1/pC).  Asp  -*  1,  the  waiting  time  dominates. 
Pb  1,  die  multiple  server  operation  has  an  advantage  in 
time  delay  of  a  factor  of  k.  Although  we  concentrate  ut  tins 
paper  on  packet  time  delay  as  a  performance  measure,  many 
other  network  situations  foes  on  waiting  time  In  that  case, 
greater  reductions  in  delay  mav  be  achieved  since  PB  is  of  trie 
order  of  pk . 

In  making  the  above  comparison,  we  assume  that  the  link 
and  node  utilizations  are  not  mcrexsed  when  adaptive  routing 
is  used  to  obtain  multiple  server  performance  at  the  nodes  As¬ 
sume  first  that  only  minimum  hop  paths  are  used  for  the  non 
adaptive  routing,  resulting  in  single  server  queues  at  the  links. 
Allow  the  adaptive  routing  to  choose  between  all  minimum 
hop  paths  for  each  commodity  Tins  will  produce  the  same 
utilization  in  both  cases.  If  other  paths  are  used  in  the  non- 
adaptive  routing,  similar,  but  less  precise,  arguments  can  he 
made.  It  will  be  seen  in  the  next  sections  that  the  adaptive 
routing  performs  better  when  more  of  the  traffic  has  choice. 
Since  there  is  a  significant  improvement  in  time  delay,  when 
the  utilization  is  moderate  it  is  possible  to  trade  off  a  higher 
utilization  by  using  longer  paths  with  the  resulting  greater 
choice  In  the  examples  discussed  below,  we  have  not  found 
this  to  be  necessary.  In  some  cases  where  minimum  hop  paths 
are  not  used  in  the  optimum  nonadaptive  routing,  the  adaptive 
routing  could  still  provide  a  choic*  of  similar  length  paths. 
Then  the  utilization  will  not  be  increased. 

These  observations  are  summarized  in  figs  1  and  2.  In  f  ig 
1  we  compare  waiting  times  for  queues  with  two  servers  and  a 
single  stream  of  input  traffic.  Multiple  server  operation  is  best 
In  the  othet  thice  cases,  a  queue  is  provided  lor  each  seiver 
and  'he  input  stream  is  divided  tr,i->  'wo  in  ditleient  wavs  In 
order  of  decreasing  pei tonnance  an  arrival  loins  the  .shottei 
queue,  arrivals  alternate  between  the  two  queues,  and  arrivals 
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Fif.  1*  Wtitifif  time  in  i  node  with  two  terms  for  different  routlnj 
decision  rule*. 


UTILIZATION,  t 

Fif.  2.  W si  tint  time  In  ■  muitipU-term  node. 

randomly  select  a  queue.  The  Utter  b  equivalent  to  two  single 
server  queues.  In  Fig.  2  we  contrast  multiple  server  and  single 
server  performance  in  a  node. 

Our  adaptive  routing  procedure  win  attempt  to  obtain  these 
performance  advantages. 

IV.  ADAPTIVE  ROUTING  PROCEDURE 

First  consider  routing  all  packets  at  a  node  by  allowing 
them  to  use  any  outgoing  link.  The  node  can  now  be  modeled 
as  a  multiple  server  queue  and  achieves  the  already  described 
performance  advantage  over  single  server  operation.  Path 
lengths,  however,  would  become  extremely  long  because  of 
this  random  routing,  the  total  traffic  in  the  network  would  in¬ 
crease  catastrophically,  and  total  network  performance  would 
be  destroyed.  We  will  show  how  to  achieve  the  time  delay 
advantage  without  this  deterioration  in  total  network  perform¬ 
ance. 

We  propose  a  two-level  adaptive  routing  procedure.  The 
first  level  is  global,  and  while  sliU  adaptive,  is  expected  to  be 
slowly  varying,  responding  to  average  statistics  of  congestion 
and  traffic  and  alarms  due  to  link  failure,  onset  of  congestion, 
new  traffic,  etc.  We  assume  that  some  mechanism  exists  for 
making  adjustments,  and  that  these  will  be  made  relatively 
infrequently  compared  to  the  rate  of  second  level  adaptivity. 
We  will  assume  in  this  paper  that  the  first  level  of  adaptation 
does  not  change  over  the  period  of  interest. 

The  first  level  consists  of  assigning  a  set  of  "good"  paths  to 
each  commodity  or  node  pair  that  wish  to  coniir.iinican-  In 
stead  of  finding  the  "best"  path,  we  look  for  as  large  a  set  as 
possible  of  paths  that  are  good  in  some  sense  a  small  number 


of  hops.  smali  .  >  ,.'s;mn  cum  A  similar  approach  was  pre¬ 
sented  m  (‘'I  i  v  e  !o  not  force  ourselves  at  this  level  to 
choose  among  alternative  paths  that  are  of  the  same  or  similar 
quality .  That  done  m  the  second  level.  Die  second  level  per¬ 
forins  better  if  there  are  more  alternative  paths.  There  is  an 
inherent  tradeoff  here,  since  choosing  longer  paths  places 
more  traffic  on  the  network.  We  assume  thst  a  selection  of 
paths  is  possible  and,  although  we  do  not  investigate  it  here, 
this  it  can  be  made  slowly  adaptive. 

As  s  result  c-f  the  first  level  procedure,  there  will  be  nodes 
where  the  paths  for  a  particular  commodity  intersect.  Including 
the  originating  node.  Packets  at  such  a  node  have  a  choice  of 
which  outgoing  link  they  can  use  (two  or  more  links  will  be 
in  the  vim  .  aiiowable  paths).  There  are  certain  links  which 
may  not  be  us -a  by  a  particular  commodity-they  lead  to  bad 
paths.  We  do  not  force  the  choice  on  the  packet  upon  arrival 
at  the  node,  but  operate  the  node  as  a  (constrained)  multiple 
server  facility.  If  ail  packets  could  chooae  any  of  the  links, 
then  it  would  indeed  be  a  muliuerver  facility  and  we  would 
get  the  k-fold  improvement  in  delay.  We  will  show  that  most 
of  this  improvement  is  still  obtained,  even  when  some  servers 
cannot  service  some  commodities. 

This  two-ievel  adaptive  procedure  will  then  give  us  the  de¬ 
sired  improvement  in  time  delay  while  maintaining  good  paths. 

V  ADAPTIVE  NODE  MODEL 

First,  as  shown  in  Fig.  3,  consider  a  node  with  two  outgoing 
links  (a  and  b ).  Some  commodities  have  assigned  paths  that 
only  use  one  of  these  links.  They  have  no  choice  and  will  be 
routed  directly  to  a  dedicated  queue  which  can  only  be  served 
by  one  of  the  servers.  Denote  the  total  rate  of  traffic  joining 
each  of  the  queues  as  X*  and  \b.  Other  commodities  (of  total 
rate  >*.)  have  paths  which  use  either  link  a  or  b.  We  assign 
them  to  a  tht'd  queue  (c).  Server  a,  for  example,  may  choose 
to  serve  a  packet  from  a  or  c,  but  not  b.  We  will  discuss  dif¬ 
ferent  strategies  for  the  server. 

The  only  way  this  constrained  multiple  server  queueing 
system  differ;  from  M/M( 2  behavior  Js  when  queues  a  and  e 
are  empty,  for  example,  while  queue  b  is  not.  One  of  the 
servers  (a)  is  idle  while  the  node  still  has  some  work.  To  reduce 
the  possibility  of  this  happening,  we  could  keep  queue  e  as 
large  is  possible  Thus,  an  excellent  strategy,  which  appears  to 
be  optimum  is  to  give  priority  to  the  dedicated  queues,  serving 
queue  c  only  *Tte.n  one  of  Lhe  dedicated  queues  is  empty. 
Another  strategy  is  to  allow  packets  from  the  stream  V  to 
join  the  shorter  of  the  two  queues  (thus  eliminating  the  need 
for  the  third  queue)  We  have  (4)  evaluated  these  and  other 
strategies  for  tins  node  mode!  and  more  complex  node  models, 
and  have  found  the  performance  of  the  node  as  measured  by 
average  time  delay  to  be  fairly  insensitive  to  the  strategy 
adopted  bv  the  servers  Although  different  classes  of  traffic 
may  suffer  radically  different  time  delays,  the  average  delay  of 
all  iraffic  in  a  node  remains  roughly  the  same  for  many  good 
stiatepics  We  \m'I  consider  bel.iw  only  two  strategies  —  the  one 
wlu«.h  gives  pri"iih  to  the  dedusted  queues  and  the  "le  m 
aIiu'i  arov.ils  ■•".n  the  *.*>,  ;irst  uueue 

In  I  n-  i  u e  M  e  jn-.  jr,-  oueue  length  K(X)  versus 
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Fi*.  3.  A  node  with  two  servers. 


Fig.  4.  Estimated  average  number  of  packets  in  a  node  with  two  links. 
Simulation  results. 


terns.  We  assume  that  the  two  link  capacities  are  equal  to  C, 
K  =  *6.  Pa  ~  X«/pC.  pc  =  Xc/pC,  p  =  2p.  +  pc-  The  lower 
bound  is  for  an  M/M/2  queue  or  all  nondedicated  traffic  (Xa  = 
0).  The  upper  bound  is  for  all  dedicated  traffic  (Xc  =  0)  and 
represents  two  M/M/1  queues.  In  general,  all  of  our  results  lie 
between  those  two  bounds  of  performance.  The  circled  points 
are  simulation  results  for  different  traffic  mixes  and  the  strategy 
that  gives  priority  to  dedicated  queues.  From  this  figure  and 
from  similar  results  for  three-server  nodes  and  different  traf¬ 
fic  patterns  (as  discussed  below  and  in  Fig.  5),  the  following 
rule  of  thumb  has  emerged.  As  long  as  15-20  percent  of  the 
traffic  is  not  dedicated  and  the  choice  of  traffic  is  not  appor¬ 
tioned  to  disjoint  sets  of  servers,  then  the  node  behavior  will 
achieve  more  than  half  of  tltc  improvement  of  an  M/M/k 
queue.  It  has  been  shown  [7] ,  [8]  that  in  the  limit  of  heavy 
traffic  (liigh  utilization)  a  “join  the  shortest  queue"  policy  tends 
towards  the  performance  of  an  M/M/k  queue.  This  is  true  even 
if  some  of  the  arrivals  cannot  join  some  of  the  queues,  but 
there  must  be  some  linkage  between  all  the  queues.  This  is  just 
our  node  model.  Since  we  believe  that  our  priority  strategy  is 
better  than  a  join  the  shortest  queue  policy,  we  have  that  in 
the  limit  of  heavy  traffic,  M/M/k  behavior  is  achieved. 

The  above  remarks  hold  for  nodes  with  any  number  of 
links.  Assume  that  a  node  has  k  links.  Then  there  arc  2*  -  1 
classes  of  input  traffic  k  classes  of  dedicated  traffic,  one  to 
each  server;  k(k  —  l)/2  classes  of  traffic  assigned  to  specific 


M/M/ 1 
behavior 


Proportion  of  traffic  having  choice 


M/M/3 

behavior 


Fig.  5.  Performance  of  a  node  with  three  links.  Simulation  results. 


pairs  of  servers;  •••;  and  finally  one  class  of  traffic  that  can  use 
all  servers  (links).  Many  of  these  classes  may  be  empty.  But 
if  they  span  all  servers,  and  if  the  total  traffic  that  is  not 
dedicated  is  roughly  15-20  percent  of  the  total  traffic,  then 
nodal  delay  is  reduced  by  a  factor  of  order  k  over  a  single 
queue  model.  If  the  traffic  having  choice  is  too  light  or  if 
the  servers  are  in  isolated  groups,  then  this  advantage  is  re¬ 
duced. 

In  Fig.  5  the  normalized  time  delay  (in  units  of  service  time 
or  message  length)  is  given  as  a  function  of  the  amount  of  traf¬ 
fic  having  choice  at  a  three-server  node  and  for  various  utiliza¬ 
tions.  These  results  were  obtained  by  nodal  simulations.  The 
left  limit  of  the  graph  represents  M/Ml  l  or  nonadaptive  be¬ 
havior.  The  right  limit  represents  M/M/3  or  the  limiting  adapt¬ 
ive  behavior.  There  are  two  types  of  traffic  that  have  choice 
at  a  three-server  node.  One  type  may  be  served  by  any  of  the 
three  servers.  The  other  type  may  be  served  by  only  two 
servers-there  are  three  different  members  of  this  type.  A  va¬ 
riety  of  different  combinations  were  simulated.  The  resulting 
time  delay  showed  only  slight  variations  for  different  traffic 
patterns  and  depended  only  upon  the  total  traffic  havingchoice. 
It  is  seen  from  Fig.  5  that  if  15-20  percent  of  the  traffic  has 
choice,  then  the  behavior  is  significantly  closer  to  M/M/3  than 
to  M/Ml  1. 

For  high  utilizations,  the  time  delay  is  dominated  by  the 
waiting  time  and  the  maximum  improvement  expected  is  by  a 
factor  equal  to  the  degree  of  the  node.  For  moderate  utiliza 
tions,  M/M/k  operation  would  result  in  even  greater  relative 
savings 

We  have  been  able  to  develop  upper  bounds  on  node  per¬ 
formance  that  exliibit  some  of  the  above  behavior  For  a  more 
accurate  understanding,  we  have  resorted  ’.o  the  simulation  re¬ 
sults  already  described 

Consider  the  iwo-server  node  <>t  Fig.  .*  and  assume  dial  the 
dedicated  queues  have  pnorit>  Furtlu’imore,  lei  the  capacities 
C  r  I  for  convenience  Then  we  .an  derive  the  following  e\ 
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pressions: 

n£(K)  =  fii  +\,£(K)  (3) 

M£(b’&)  =Pl  +  (4) 

p£(K)<  3  Pb  +  5  KW’c)  +  nun  { X,(  /T(  IV, )  4-  £T(  K',. )] } . 

i-a,  b 

(S) 

In  these  equations  IV,-  is  the  waiting  time  of  queue  i,  £(W,)  is 
its  expectation,  p,  =  (X„  +  a Xc)/p,  p2  =  [Xa  +  (1  —  a)X(.]/p, 
and  a  is  the  proportion  of  Xf  traffic  served  by  server  1 .  Pg  is 
again  the  probability  that  the  system  is  busy.  Equation  (3)  is 
derived  as  follows.  An  arrival  to  queue  a  must  wait  for  server  1 
to  finish  service,  if  it  is  occupied,  and  then  wait  for  all  mes¬ 
sages  that  it  found  in  queue  a  to  be  served.  Recall  that  queue  a 
has  priority.  The  probability  of  server  1  being  busy  is  Pi ,  there 
are  XaEfW,,)  messages  already  on  queue  a  on  the  average,  and 
each  message  has  an  average  service  time  of  1/p. 

Equation  (5)  is  more  difficult  to  denve.  An  arrival  to  queue 
c  waits  for  the  next  completion  of  service  if  the  system  is 
busy.  The  probability  of  this  i sPB  and  the  average  time  to  the 
next  completion  is  l/2p.  It  must  also  wait  for  all  the  messages 
that  it  found  on  queue  c  when  it  arrived.  Again,  the  average 
number  of  these  is  X«.£(IVC)  and  each  goes  into  service,  when 
allowed,  with  a  rate  2^.  But  queue  c  does  not  have  priority. 
Thus,  it  must  also  wait  for  the  first  queue  to  put  into  service 
all  messages  found  in  that  queue  upon  arrival,  X^Zf/ U',)  on  the 
average,  and  for  all  other  messages  that  arrived  to  that  queue 
while  the  message  on  queue  c  was  waiting  — X,£(WC),  on  the 
average.  The  last  term  in  (5)  should  be  the  expectation  of  the 
minimum  of  two  random  variables-the  total  number  of  mes¬ 
sages  in  queues  a  or  b  while  our  message  waits  in  queue  c.  We 
upper  bound  this  with  the  minimum  of  the  two  expectations. 

If  we  let  X„  =  \b.  then  a  =  1/2.  Furthermore,  we  upper 
bound  fg  bvp,  --  p2 .  Equations  ( 3)—(5 )  can  now  be  solved  to 
obtain  for  the  average  waiting  time  for  all  messages: 

~  (1  -$/*)  (6) 
1  ~P 

where  0  =  Xc/(Xa  +  \b  +  Xf),  the  fraction  of  traffic  having 
choice.  Note  that  for  M/M/\  behavior,  p£(W)  =  p/(  1  -  p), 
and  for  M/M/2  behavior,  p£(k')  =  (p/(l  -  p))(p/(I  +  p)).As0-*- 
0,  our  upper  bound  converges  to  M/M/ 1  behavior.  As  0  -*  l, 
our  unper  bound  yields  a  reduction  of  1/2,  where  M/M/2  be¬ 
havior  has  a  better  improvement  of  p/(  1  +  pi  Our  upper  bound 
on  I’p  is  causing  the  discrepancy  here.  In  any  event,  we  note 
the  linear  behavior  with  0  in  (6). 

A  similar  expression,  with  similar  results,  can  be  derived  for 
a  thicc-server  node.  Let  the  arrival  rate  to  the  dedicated 
queues  be  equal,  and  assume  only  one  oilier  class  of  traffic 
which  can  choose  hetween  all  three  servers  Then  we  obtain 

q£  (  It ,  =  ( 1  -  j  0)  t  ‘ » 

I  -p 

where  >1  has  the  same  mcanine  as  above  Here,  is  J  *  I  p/  hi  I 


is  a  third  of  that  obtained  with  M/M!  I  queues  whereas  an 
M/M, '3  queue  would  have  an  even  smallei  waiting  tunc 

VI  NETWORK  1*1  KK)K VANCE 

In  general,  we  can  say  that  ii  we  adopt  this  adaptive  routing 
procedure,  and  il  at  each  node  there  is  a  sutti.ienl  amount  ol 
nondedicated  traffic,  then  at  high  utilisation  the  tune  delay  at 
each  node  is  reduced  by  almost  the  degree  of  that  node  from 
the  time  delay  using  nonadaptive  routing  The  effect  on  the 
network  will  be  to  reduce  the  time  delay  by  a  factor  on  the 
order  of  the  average  degree  of  the  nodes  For  moderate  utiliza 
tions,  a  large  part  of  that  advantage  will  still  be  achieved  As¬ 
sume.  for  simplicity,  that  using  nonadaptive  routing,  all  links 
have  the  same  average  tune  delay  7~, .  Then  T  —  IT , .  If  our 
adaptive  strategy  is  used,  then  Tin ,  =  \/k(n)T]  where  kin)  is 
the  degree  of  node  n.  For  this  adaptive  routing,  the  network 
delay  becomes  at  best  T  ^  IT t  *  2in(X,n)/A:(n))/X  If  all  = 
k.  then  the  reduction  is  of  the  order  of  A  . 

The  accuracy  of  the  approximate  performance  at  a  node 
improves  as  more  of  the  traffic  has  choice  and  as  the  utiliza¬ 
tion  increases.  These  results  serve  as  a  quick  estimate  of  the 
benefits  obtainable  from  adaptive  routing  and  are  useful  as  a 
design  tool. 

VIII.  EXAMPLES 

We  have  evaluated  many  examples  of  networks  [4] .  Most 
of  these  had  some  symmetry  and  a  modest  number  of  nodes 
(at  most  20).  Our  evaluation  procedure  was  as  follows.  We  first 
obtained  the  optimum  nonadaptive  routing  and  evaluated  its 
performance.  In  most  of  the  examples,  this  resulted  in  using 
shortest  paths.  We  then  allowed  the  adaptive  routing  to  use 
all  shortest  paths  for  each  commodity.  We  then  found  the 
amount  of  traffic  that  had  choice  at  each  node.  For  a  wide 
variety  of  situations,  there  was  sufficient  nondedicated  traf¬ 
fic  at  each  of  the  nodes  to  justify  our  approximation. 

There  are  two  steps  in  a  careful  evaluation-the  computa¬ 
tion  of  the  amount  of  traffic  of  each  class  (with  respect  to  the 
nature  of  choice  )  in  each  node  and  the  evaluation  of  node  per¬ 
formance.  We  have  already  discussed  good  estimates  for  the 
latter.  The  former  step  can  be  quite  complex,  since  it  depends 
upon  how  traffic  is  split  at  each  node  at  which  it  has  choice. 
We  have  developed  several  approximate  techniques  to  perform 
this  step  (4) .  For  example,  consider  Fig.  3  again.  Let  a  be  the 
fraction  of  Xr  traffic  that  is  served  by  a  lhe  total  utilization 
of  server  a  is  (X0  +  ok(  I/p  -  p  t .  Similarly,  define  p;  =  ( Xfc  + 
(1  a)\r]/p.  Queue  a.  including  the  server,  is  empty  with 

piobabilitv  I  P|  A  message  on  queue  c  will  be  served 
by  a  only  if  the  entire  queue  is  emptv  In  us .  appro  .mute  q  'ov 
or  —  ( 1  -  P| )/( 2  -  P|  p ^ ).  This  can  he  solved  for  a  to  obtain 
a  -  ( 1  -  p„)/<2  -  p„  ph)  where  p„  >a  p.  pb  =  Xh/p  This 
approximation  has  been  verified  by  simulation  [4| 

We  suminaii/e  one  example  ot  a  six  nude  network  shown  in 
i  ,g  n  1  ,ul.  ot  the  no,..-,  h  r.  decree  3  We  allow  only  shortest 
pull  lies.-;  oik  oi ; -  ...  .on  }>•.-.  ,  lli.r  all  Iralt'k 

lli'l,-  I  l.ilul  -iei  I-,  o  .....  '  , .  im:  I  r.a!  h.  tr oil)  !  to  3 

or  .  ius  a  clue  e  >'  three  <v  11  y  p.i'i  -.  We  assume  that  all 
links  have  tin"  -a:  .  ,  II-'..  .'<  |'|  and  all  y,,  r  It  is 

I  all  iv  e.i-y  In  ••  ini!  -  "  ■  '  .  and  .'  *  >  \  in 
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adaptive  optimum  routing  can  be  found  to  that  the  flows  in 
each  link  are  identical,  X(  *  7/3 r.  Thus,  for  nonadaptive 
routing,  T ■*  1 .4/(10  -  7/3 r).  Asr  —  30/7  «=  4.29,  the  adaptive 
routing  will  give  an  advantage  of  a  factor  of  3 . 

We  now  compare  the  optimum  nonadaptive  routing  with 
our  adaptive  routing.  For  nonadaptive  routing,  we  used  the 
above  results  for  single  server  queues.  For  the  adaptive  routing, 
we  used  the  results  of  nodal  simulations  for  three-server  nodes 
and  T  m  IT(m)  u  discussed  in  Section  VI.  For  r  =  1.84,  p  = 
0.43,  the  nonadaptive  routing  had  a  time  delay  of  0.24S, 
while  the  adaptive  routing  had  a  time  delay  of  0.197.  For  r  *» 
3.06,  p  “  0.71 ,  the  results  were  0.490  for  nonadaptive  routing 
and  0.29S  for  adaptive  routing.  Thus,  we  have  reduced  the 
time  delay  by  a  factor  of  1.66  in  the  latter  case.  In  this  ex¬ 
ample,  the  amount  of  traffic  having  chc  ce  at  each  node  was 
2/7  of  the  total  or  29  percent.  If  full  multiple  server  perform¬ 
ance  was  attainable,  the  time  delay  would  have  been  0.217. 
Thus,  70  percent  of  this  Improvement  was  attained.  The 
improvement  Increases  if  more  traffic  has  choice.  As  the  utili¬ 
zation  increases,  the  full  benefit  of  multiple  server  perform¬ 
ance  is  achieved.  The  comparison  can  also  be  made  with  respect 
to  waiting  timea  since  the  service  times  are  constant  and  can¬ 
not  be  reduced.  Here  the  service  time  for  the  network  is  0.14. 
Thus,  for  43  percent  utilization,  the  waiting  time  has  been 
reduced  from  0.10S  to  0.0S7  by  use  of  adaptive  routing— a 
factor  of  1214.  At  71  percent  utilization,  the  reduction  is  from 
0.350  to  0.155— a  factor  of  2.26. 

A  number  of  other  examples  with  as  many  as  20  nodes  have 
been  investigated  [4] .  Most  had  some  amount  of  symmetry, 
but  not  nearly  as  much  as  the  example  cited  here.  One  12- 
node  example  was  generated  arbitrarily.  In  all  cases,  the  above 
procedure  was  followed.  The  optimum  nonadaptive  routing 
was  found.  In  almost  all  of  the  examples,  shortest  paths  were 
used.  The  adaptive  routing  was  created  by  giving  a  choice 
among  all  shortest  paths  for  each  commodity.  There  was  suf¬ 
ficient  choice  at  most  nodes. 

The  effects  of  asymmetry-of  the  topology,  the  traffic 
matrix,  and  link  capacities-have  not  yet  been  adequately 
studied.  For  very  small  networks  or  pathological  cases,  longer 
paths  may  have  to  be  used  by  the  adaptive  strategy  to  achieve 
sufficient  choice  at  some  nodes.  This  will  increase  the  utiliza¬ 
tion  and  reduce,  and  perhaps  eliminate,  the  performance 
advantage.  However,  the  performance  advantage  seems  signifi¬ 
cant  enough  to  leave  some  margin  for  this  tradeoff. 

Asymmetries  in  link  capacities  and  traffic  patterns  create 
another  problem.  The  second  level  of  adaptive  routing  is  based 
on  local  information  only.  This  may  lead  to  saturation  at  other 
nodes.  The  first  level  of  the  adaptive  routing,  as  described 
above,  may  not  have  enough  flexibility  to  circumvent  this.  Wc 


have  begun  to  study  a  variation  that  si.ows  some  promise.  Con¬ 
sider  traffic  beiween  a  pan  of  nodes  that  lias  two  or  more 
good  paths.  Previously  we  iv.urn.d  nut  all  iliat  traffic  was  to 
be  given  choice  Here  we  dedicate  some  of  that  traffic  to  speci 
fic  paths  in  order  to  balance  the  load.  For  example,  we  may 
wish  to  produce  a  flow  distribution  similar  to  that  used  in  die 
nonadaptive  routing  scheme.  The  remainder  of  the  traffic  is 
allowed  to  have  choice  as  before.  Since  the  amount  of  traffic 
having  choice  at  a  node  is  not  critical,  it  is  expected  that  it  will 
still  be  possible  to  achieve  a  performance  advantage.  Several 
simple  examples  that  have  been  tried  have  been  encouraging. 

A  brief  report  on  two  further  examples  illustrates  these 
ideas.  Details  can  be  found  in  [4] .  Fig.  7  shows  an  asymmetn 
cal  eight-node  network.  We  assume  equal  channel  capacities 
and  equal  requirements  between  pairs  of  nodes.  Four  of  the 
nodes  have  two  servers;  the  other  four  have  three  servers. 
Hence,  we  expect  the  maximum  improvement  to  be  by  a  factor 
between  2  and  3.  The  optimum  nonadaptive  routing  uses 
shortest  paths.  The  adaptive  routing  allows  choice  among  these 
shortest  paths,  thus  retaining  the  same  utilization.  The  most 
utilized  links  were  the  two  connecting  clusters  of  four  nodes. 
When  these  were  80  percent  utilized,  the  adaptive  routing 
scheme  outperformed  the  optimum  nonadaptive  scheme  by  a 
factor  of  1 .6  for  time  delay  and  2.6  for  waiting  time.  When  the 
utilization  was  90  percent,  these  factors  were  2.2  and  almost 
3,  respectively. 

Consider  another  asymmetrical  eight-node  network,  shown 
in  Fig.  8.  Here  we  assumed  an  asymmetrical  traffic  matrix.  If 
only  shortest  paths  were  allowed,  little  traffic  has  choice,  a 
nonadaptive  and  our  adaptive  scheme  are  almost  identical, and 
the  time  delay  is  6.24  for  both.  The  optimum  nonadaptive 
routing  had  a  time  delay  of  4210.  An  adaptive  routing  can  be 
found  using  the  same  length  paths  for  each  node-pair  traffic. 
This  resulted  in  a  time  delay  of  4D8— a  15  percent  improve¬ 
ment.  There  was  not  enough  choice  to  get  more  improved 
performance  in  the  nodes.  When  more  choice  was  allowed, 
increasing  the  utilization,  the  tune  delay  increased  to  4.62. 

-The  service  time  here  was  equal  to  2.  Hence,  the  waiting  uir.e 
for  shortest  paths  was  4.24,  the  optimum  nonadaptive  routing 
had  a  waiting  time  of  2210,  and  the  adaptive  routing  with 
equal  length  paths  had  a  waiting  time  of  2.08-a  25  percent 
improvement. 

More  study  is  needed  to  fully  understand  the  influence  of 
these  asymmetries  on  this  type  of  adaptive  routing. 

VIII.  CONCLUSIONS  AND  FURTHER  WORK 

We  have  presented  a  two-level  adaptive  routing  procedure 
that  exhibits  marked  impiovement  in  performance  over  non¬ 
adaptive  routing.  Moreover,  we  have  presented  some  analytic 
approximations  to  estimate  this  improvement.  In  heavy 
traffic,  we  have  shown  the  improvement  to  be  by  a  factor 
equal  to  the  nodal  degree  (or  average  nodal  degree),  (n  general, 
for  moderate  utilization,  a  large  part  of  this  improve  lent  is 
still  achieved. 

A  more  elaborate-  simulation  of  a  network  is  underway.  In 
that  work,  the  independence  assumption  is  not  made  Al 
though  it  confirms  our  model,  it  has  shed  some  lighi  on  ihe 
"independence"  assumption  We  found  ilia!  at  Ingli  utili/a 
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An  eiffht-node  asymmetrical  network. 


Fig  8  An 


•t 

f\  Nov 


eight-node  asymmetrical  network  used  with  an  asymmetrical 
traffic  matrix. 


tion  we  were  getting  most,  but  not  all,  of  the  expected  im¬ 
provement  Many  different  nodal  strategies  had  only  a  modest 
effect  on  resolving  this  gap  We  found  that  it  was  easy  to 
generate  strategies  that  produced  multiple  server  behavior.  But 
some  of  them  had  a  profound  effect  on  the  output  process 
from  the  server,  and  thus  affected  the  Poisson  assumption  for 
inputs  to  the  next  node.  The  priority  scheme  was  notorious  in 
this  regard  Usually  any  traffic  one  hop  away  from  its  destina¬ 
tion  will  be  placed  in  a  dedicated  queue.  Thus,  only  continuing 
traffic  will  be  found  in  nondedicated  queues.  These  have 
lower  priority  and  will  be  transmitted  only  when  the  dedicated 
queue  is  empty.  But  then  a  burst  of  several  packets  is  likely  to 
he  sent  A  bursty  arrival  process  results  in  poorer  queue  per¬ 
formance  than  a  Poisson  process.  This  contributed  signifi¬ 
cantly  to  the  delay  performance  gap  The  design  of  a  nodal 
strategy  must  consider  not  only  multiserver  performance 
achievement  at  a  node,  but  the  resultant  output  process  as 
well.  This  is  the  direction  of  our  current  research 

The  usual  way  such  a  burst  could  occur  is  when  the  packets 
are  short.  But  these  should  have  made  the  next  node’s  job 
easier  Unfortunately,  the  second  aspect  of  the  independence 
assumption  now  comes  into  play-packet  lengths  are  reassigned 
a>  each  node.  This  accentuates  the  effect  of  burstiness.  When 
the  simulation  was  altered  to  take  into  account  dependence 
between  nodes,  hence  dispensing  with  the  independence  as¬ 
sumption,  the  difficulty  disappeared. 

ACKNOWLEDGMENT 

The  simulation  work  described  in  the  last  section  has  been 
performed  by  P.  Chu. 

REFERENCES 

|l|  M  Oerh  Deterministic  and  adaptive  routine  policies  in  packet 
synched  computer  networks.  "  in  Pros'  Ird  Data  ( Hmmun  S\»ip 
St  Petersburg  H..  Nov  197V  pp  2"*  ?g 

121  K  G  (ijllaccr  A  minimum  doluv  routine  algorithm  tome 
distributed  umipuljlion  Ih.ht.  1  runs  (  ommun  \-»l  C»iM 
pp  "*  *  X*  Jan  197? 

|*|  D  f»  t  .ihittf  ,ti>,|  M  Gi*flj  '  Optim.il  rou'im’  m  ,i  p.t,  i .  i  mmk  h,  ,| 
iompiii-.T  ncivfc.uk  fff-f-  I  runs  (fmpt.l  vnl  (  '  1  pp  Hii-’ 

l<»bH  CM  1974 


A  l  iv  no  and  R  R  Roorstvn  On  a  technique  I  *  *r  dsnami. 
routine  in  Pros  Sat  1  fUu  (  •>nt  Dallas  f\  Nov 

19^0  n  i?  ’  |  M\:i  |  ivnc  Dvitjitm  routing  >n  v  mtipuict 

voinrnunu  ation  iifiunrl.  I’h  |>  d<*  crt.in.»n  I*  -Ivirt  hr-.  Ihm 
New  >  oft.  lifi  m  >k  I  v  n  N  >  Jt)|\  lu'r-  -\  ,  .  t- 

Hoorstvn  A  model  tor  cititient  romme  tn  S  I  vomputri  .  mi< 
mnniuMons  networks  tn  Pr,„  /  /  R(i  if  IP  ’9  P  a  s.(ritri 
l.d  Amsterdam  North  Holland.  1979  pp  \4  V  *M) 

H  Rudm  “On  routine  and  delta  routine  A  taionomy  and 
performance  comparison  ot  techniques  for  packet  switched  ne« 
^I’rks  ft-t  A  Trans  (  itmmun  vol  COM-24,  pp  4^-S9  Jan 
197b 

I  K le  inrock .  Queuing  S \ stem s  New  York  Wdev.  1975 
Ci  J  Foschim  and  J  Sal/.  “A  hasu  dynamic  rout ■  np  problem  and 
diffusion  ftt.t.  7 runt  Comma n  vnl  COM  2b  pp  ^2B  v?" 
Mar  1 97K 

Ci  J  Foschim .  "On  heavy  traffic  diffusion  analysis  and  dynamic 
routine  in  packet  switched  networks  "  Compuirr  Performam  e . 
Chandv  and  Reiser.  Ed  Amsterdam  North  Holland  1977  pp 
499-5  I  5 


Robert  R.  Boorslvn  (M  SKi  was  born  in  New 
York  NY.  on  Mas  28.  19^7  He  received  the 
w  *  B  t  E  decree  from  the  City  College  of  New 

*  York.  New  York.  N5  in  1958  and  the  M  S  and 
*•  y  Ph  I)  degrees.  both  in  electrical  engineering 

£  —  from  the  Polytechnic  Institute  of  Brooklyn. 

^  '  Brooklyn  NY.  in  I9M  and  I9bb  respectively 

^  ^  ;  From  1958  to  |9b|  he  worked  as  an  Engineer 

for  the  Advanced  Studie*  Department  of  the 
s'Jl  k  «•  •  i  Sperry  Gyroscope  Company  In  19b!  he  joined 

the  staff  of  the  Department  of  Electrical  Fnei 
neerine  Polytechnic  Institute  ot  Brooklyn,  now  the  Polytechnic  Institute 
of  New  York .  w  here  he  is  currently  a  Professor  From  1977  to  1978  he  w  as 
on  leave  at  Beff  Laboratories  where  he  conducted  research  in  computer 
communications  networks  His  current  research  mteicsts  are  in  computer 
communication  networks,  data  communications,  and  communication 
theory 

Dr  Boorstvn  has  been  Secretary  of  the  IEEE  Information  Theory  Group 
and  Editor  for  Computer  Communication  of  the  IEEE  TRANSACTIONS  ON 
COM  Ml  'NIC  A  TION  S  He  is  Chairman  of  the  Computer  Communication 
Committee  of  the  IEEE  Communications  Society  and  Associate  Editor  of 
the  journal  Networks  He  was  a  member  of  the  delegation  to  the  first  joint 
I’SSR-IEF.E  Workshop  on  Information  Thtrrv  Moscow .  December 
1975 


Adam  l.ivne  tM  72)  received  the  B  Sc  and 
M  Sc  degrees  in  electrical  engineering  from  the 
f  9  Tcchnion  —  Isrjt  I  Institute  of  Technology .  Haifa. 

<  \  in  *9*9  and  I9b^  rcspectivelv  and  the  Ph  D 

degree  in  electrical  eng  inciting  from  the  Polv 
iechniv  Institute  of  New  York.  Brooklyn.  N"V 
* —  From  l4,sv*  to  |9bS  he  wav  an  I  Ir.  M.u.i. 

Design  (>tt.v  -r  in  the  Isrjcli  Saw.  whiic  he 
«.  ondm  led  x ommuni.  jlions  and  radar  systems 
•;  designs  In  |9ps  he  joined  the  Scicntili-.  Dc 
*  r  panntent  of  the  Ministry  of  Defence  as  a  Re 

search  Enj.ncer  in  the  t  omioumcjtion  Department  He  look  pail  and 
directed  advanced  RAD  projects  in  signal  proecssme  and  commun»xahoh 
sv  stems  Between  |9b9  and  he  worked  at  the  l  ockhced  Fleitronns 

Compin-  Plair.ticld  NJ  ,iv  j  Pnneip.il  Fngincer  in  the  design  an  1 

Mi;ii»l.ilion  of  dieital  sicn-ii  r-ioxcssnic  systems  During  i^p  hr 

.  uvple -led  ho  studies  lov  .ml  the  I’h  I)  Oc . tee  ji  the  I’olv tc.  him  Invtmite 
.it  N  «  5  ork  \»  prC'C'ii  tn  I-  v  ni.ir  ki  scjfvt’  I  ncmcer  at  *hc 

f  h  x  ti-u-u  Division  S» 'cniifi.  I  >rj»  i " mem  Min.sirv  ot  1‘eten.  r  lr! 

\ v  i\  Kr  j.-‘ 


! 


B.2  A  Simulation  Study  of  a  Dynamic  Routing  Scheme 
Chu,  Boorstyn,  and  Kershenbaum 

IEEE  National  Telecommunications  Conference,  November 


1981 ,  New  Orleans 


Reprinted  from  NATIONAL  TELECOMMUNICATIONS  CONFERENCE,  November  1981 


A  SIMULATION  STUDY  OF  A  DYNAMIC  ROUTING  SCHEME 


P.  H.  N.  Chu,#  R.  R.  Boorstyn,  A.  Kershenbaum^ 

Bell  Telephone  Laboratories  Polytechnic  Institute  of  New  York 

Holmdel,  New  Jersey  07733  Brooklyn  New  York  11201 


ABSTRACT 

A  dynamic  routing  scheme  Is  studied. 
Instead  of  assigning  fixed  paths  or 
randomly  selecting  paths,  permissible 
paths  (e.g.,  minimum  hop  paths)  are 
pre-asslgned  to  each  node  pair.  A 
packet  arriving  at  a  node,  and  destined 
for  another  node.  Is  assigned  to  one 
of  two  types  of  queues  -  a  dedicated 
one  or  a  shared  one.  The  objective  of 
the  routing  scheme  Is  to  reduce  both 
nodal  as  well  as  network  average  packet 
delay.  We  have  conducted  a  series  of 
simulation  studies  to  evaluate  the 
nodal  as  well  as  the  network  delay 
under  the  proposed  dynanlc  routing 
scheme.  Several  Important  and  encouraging 
results  are  obtained  and  are  presented 
In  this  paper. 


1.  INTRODUCTION 

A  dynamic  routing  scheme  has  been 
previously  proposed  (References  [1],  [2], 
[33,  and  [4  ]).  Instead  of  assigning 
fixed  paths  or  randomly  selecting  paths, 
permissible  paths  (e.g.,  minimum  hop 
paths)  are  pre-asslgned  to  each  node  pair. 
A  packet  arriving  at  a  node,  and  destined 
for  another  node,  is  assigned  to  one  of 
two  types  of  queues.  A  dedicated  queue 
13  a  queue  whose  packets  are  served  by 
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one  and  only  one  dedicated  server,  A 
shared  queue  Is  a  queue  whose  packets  r.a > 
be  served  by  two  or  more  specified 
servers.  A  server  Is  equivalent  to  an 
output  link.  The  purpose  of  pre¬ 
selecting  paths  Is  t  -  iv  :  i  . -r.g  ;  -. 1 
and  resultant  increased  r.etworn.  '.raff: 
that  would  occur  In  a  random  routing 
scheme.  The  purpose  of  the  share!  ;ueue 
Is  to  let  a  portl'r.  of  the  nrdai  •  raffle 
experience  the  delay  of  a  multiple 
server  queue  and  therefore  reduce  the 
average  packet  delay  at  a  node.  The 
objective  of  the  model  Is  therefore  to 
reduce  both  nodal  as  well  as  network 
average  packet  delay. 

We  have  conducted  a  series  of  simulation 
studies  to  evaluate  the  nodal  as  well 
the  network  delay  under  the  proposed 
dynamic  routing  scheme.  A  more  detalln 
report  can  be  found  lr.  Reference  [?>]. 
Section  2  describes  the  nodal  ar.d  net¬ 
work  models  as  well  as  the  different 
service  disciplines  used.  Section  3 
gives  the  results  of  nodal  and  network 
delay  performance.  A  summary  •  ?  giver, 
in  Section  A . 

2.  THE  MODEL 

2 . 1  The  Model  Model 

The  traffic  from  each  pair  Is  preassigned 
to  one  or  more  pre-selected  paths  (and 
therefore  server(s)  at  a  r.cde).  Twr 
types  of  queue  car.  be  contructed  at  a 
node.  A  dedicated  queue  can  only  be 
served  by  one  dedicated  server.  There 
are  at  most  n  such  queues  at  a  node  with 
n  links.  A  shared  queue  can  be  served 
by  any  of  several  preansiwr.-  ;  .  rv“r 

There  could  be  as  many  a:  :l'.  :  -  1 

shared  queues  at  a  node.  A  tw_  server, 
three  queue  nodal  model  is  shewn  in 
Figure  1. 
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FIGURE  1  A  NODAL  QUEUEING  MODEL  WITH  TWO 
SERVERS 


In  Figure  1,  traffic  arrives  at  the 
different  queues  with  arrival  rates 

(i  ■  1,2,3)  packets/sec.  Queue  1  and 

queue  3  are  competing  with  server  1  for 
service  while  queue  2  and  queue  3  are 
competing  for  service  by  server  2.  The 
service  rate  is  y  packets/sec.  If  one 
of  the  competing  queues  (e.g.,  queue  1 
or  queue  3)  is  empty  (e.g.,  queue  1), 
and  the  server  for  these  queues  is  idle, 
a  packet  in  the  non-empty  queue  is 
served.  If  both  queues  are  not  empty 
when  the  server  has  Just  completed 
service,  a  pre-deflned  service 
discipline  will  determine  which  queue 
to  serve  next.  When  both  competing 
queues  are  empty  (e.g.,  queue  1  and 
queue  3)  and  the  server  for  these  queues 
is  idle,  the  server  will  stay  idle  until 
there  is  a  new  arrival  to  one  or  both  of 
these  queues.  The  idle  server  will  not 
serve  the  queue  which  is  not  pre¬ 
assigned  to  it  for  service.  This  is  the 
major  difference  between  our  nodal  model 
and  a  multiple  server  queueing  system. 
The  operation  of  a  node  with  more  commu- 
catlons  links  is  basically  the  same  as 
that  of  a  two  link  node.  Shared  queues 
will  be  served  by  a  subset  of  the 
servers. 


2 . 2  Service  Disciplines 

The  nodal  model  described  in  the  previous 
section  allows  different  types  of  queues 
to  compete  for  service.  A  service 
discipline  must  therefore  be  specified. 

We  will  describe  four  different  service 
disciplines  for  the  nodal  model  of 
Figure  1. 

1.  Priority  Service  Discipline  (PSD) 

A  server  always  searches  the 
dedicated  queue  for  service  prior 
to  its  service  of  the  shared  queue. 


For  example,  server  1  in  FS.-jr-.  '.  . 
after  ;>!«■».  lug  a  service,  w !  !  .  al¬ 
ways  1  r.  \  u  queue  1  fur  mor-- 
service.  !  •  unis  In  queue  •  w  i ;  ye 
served  by  serve  1  only  If  queue  .  1:; 
empty  when  server  1  nas  Just 
completed  a  service.  Therefore, 
server  1  is  busy  as  long  as  there 
are  packets  to  be  served  in  queue 
1  or  queue  3-  Conversely,  server 

1  is  idle  and  stays  idle  when  both 
queue  1  and  queue  3  are  empty.  The 
instantaneous  queue  length  of  queue 

2  does  not  effect  any  of  the 
decision  rules  at  server  1 
described  above.  A  packet  entering 
the  shared  queue  when  both  servers 
are  Idle  is  served  randomly  by 
either  server  1  or  server  2. 

2.  Alternate  Service  Discipline  (ASD) 

A  server  at  the  node  in  Figure  1, 
whenever  possible,  alternatively 
serves  n  packets  from  the  dedicated 
queue  and  m  consecutive  ones  from 
the  shared  queue.  The  server  is 
forced  to  serve  more  than  n  (or  m) 
packets  from  the  dedicated  (or  the 
shared)  queue  if  the  other  queue 
happens  to  be  empty.  Under  this 
situation,  alternation  of  service 
will  begin  whenever  there  is  a  new 
arrival  to  the  empty  queue.  The 
process  is  started  over  again  at 
this  point. 

3.  Serve  The  Longer  Queue  Discipline 
(SLQD) 

The  queue  with  the  longer  queue 
among  the  two  competing  queues 
(the  shared  and  the  dedicated  queues 
gets  service  first.  A  server, 
upon  a  service  completion  compares 
the  queue  lengths  of  the  two 
competing  queues,  and  serves  a 
packet  from  the  longer  queue.  The 
SLQD  tries  to  equalize  all  the 
queue  lengths  at  a  node. 

4.  Random  Service  Discipline  (RSD) 

A  server  (e.g.,  server  1),  upon 
completing  a  service,  serves 
packets  from  one  of  two  competing 
queues  according  to  the  following 
algorithm:  It  serves  queue  1  if 

queue  3  is  empty  and  vise  versa; 

It  randomly  selects  a  queue  if 
both  queue  1  and  queue  3  are  non¬ 
empty.  This  is  a  totally 
"uncontrolled"  service  discipline. 

2.3  The  Network  Model 

In  order  to  describe  the  dynamics  of  a 

packet  switching  employing  the 

proposed  dynamic  routing  scheme,  one 
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must  (1)  construct  nodal  queueing 
models  similar  to  the  one  In  Figure  1 
for  every  node  In  the  network,  (2)  pre¬ 
select  a  set  of  "permissible"  paths 
for  each  source-destination  node  pair, 
and  (3)  assign  the*data  flow  according 
to  the  preselected  paths  between 
adjacent  nodes.  In  order  to  simplify 
description  of  the  network  model,  we 
have  chosen  a  four  network  to  Illustrate 
the  dynamics  of  our  network  model. 

A  four  node  network  is  shown  In  Figure 
2.  Every  node  is  connected  to  two  links, 
i.e.,  is  of  degree  two.  Therefore,  the 
nodal  queueing  model  at  each  node  is 
exactly  the  same  as  that  of  the  two 
server  three  queue  model  described 
previously.  The  permissible  paths 
between  sources  and  destinations  are 
selected  based  on  a  shortest  path 
(minimum  hop)  algorithm.  Adjacent  nodes 
will  only  use  the  one  hop  path  between 
them.  Non-adjacent  nodes  will  use  either 
one  of  the  two-hop  paths.  Therefore, 
traffic  between  adjacent  nodes  is  pre- 
asslgned  to  the  dedicated  queues. 

Traffic  between  non-adjacent  nodes,  on  the 
other  hand,  is  pre-asslgned  to  the  shared 
queue  at  the  source  node  and  the 
appropriate  dedicated  queue  at  the 
intermediate  node. 


Each  server  at  the  node  serves  the  two 
competing  queues  (the  dedicated  queue 
and  the  shared  queue)  according  to  a 
pre-specified  discipline.  We  have 
simulated  the  network  for  the  four  service 
disciplines  discussed  previously.  A 
service  discipline  where  the  condition  of 
the  next  node  Is  "known"  to  all  servers 
at  the  current  node  and  used  to 
determine  routing  is  also  studied. 

3.  RESULTS 

Simulation  programs  in  PL/1  have  been 
written  to  evaluate  the  delay  of  the 
nodal  model  of  Figure  1  and  the  network 
model  of  Figure  2  operating  under  the 


different  service  disciplines.  All  simu¬ 
lation  results  are  based  on  statistics 
collected  over  a  sufficient  time  Interval 
so  as  to  represent  the  steady  state. 

The  statistics  collected  for  the  PSD 
scheme  are  based  on  at  least  one  million 
(sometimes  four  million)  simulation 
events  -  Including  both  arrivals  and 
departures.  In  order  to  conserve 
computing  time,  the  statistics  collected 
for  other  service  disciplines  are  based 
on  at  least  two  hundred  thousand 
simulation  events.  In  our  simulation 
program,  all  arrival  processes  to 
source  nodes  are  assumed  to  be  Poisson 
and  all  service  distributions  are 
assumed  to  be  exponential  and  Independ¬ 
ent  of  each  other.  Ho  assumption  Is 
made  about  arrivals  processes  to  inter¬ 
mediate  nodes  in  the  network  model. 

3.1  Model  Delay  Performance 

The  percentage  of  the  traffic  that  Is 
shared  at  a  node  is  an  Important 
parameter  affecting  the  nodal  delay. 

We  have  studied  nodal  delay  based  on 
three  sets  of  nodal  traffic  mixes. 

They  are  low  (approximately  10  percent 
to  15  percent),  moderate  (25  percent), 
and  high  (50  percent)  percentages  for 
the  shared  traffic  at  a  node  respectively. 
Fairly  extensive  simulation  runs  were 
conducted  throughout  the  whole  range 
of  nodal  utilization  (0  to  1)  for  the 
PSD  and  a  moderate  percentage  of 
shared  traffic.  We  assumed  that  the  input 
processes  to  the  node  were  Poisson. 

3.1.1  Simulation  Results  Of  The 
Priority  Service  Discipline 

The  effects  of  the  percentage  of  the 
traffic  that  is  shared  on  nodal  delay 
are  tabulated  in  Table  1.  Notice  that 
the  higher  the  percentage,  the  closer 
the  nodal  delay  is  to  that  of  an 
M/M/2  queueing  system. 

Figure  3  plots  the  delay-throughput 
relationship  of  the  PSD  for  different 
percentages  of  shared  traffic.  The 
delay  for  the  M/M/2  and  the  M/M/1 
queueing  systems  are  also  plotted  for 
reference. 

In  Figure  4,  a  measure  of  the  "close¬ 
ness"  of  the  PSD  to  M/M/2  queueing  system 

.  ^M/M/l-— PSD  .  ■]  ,_*_  j 

performance,  -  -  -  -  _  ,  is  plotted 

%/M/l"%/M/2 

against  the  percentage  of  shared  traffic 
for  a  node  operated  under  the  PSD 
scheme.  The  utilizations  are  studied. 

The  ordinate  Is  the  percentage  of 
multiple  server  performance  that  is 
obtained  by  the  PSD.  At  high  traffic 
utilltlzation  (p-0.875),  only  25  percent 
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FIGURE  5  OELAY-THROU  GHPUT  RELATIONSHIP  FOR 
NODAL  MODEL 


3.1.2  Results  For  Other  Service 
disciplines 


In  this  section,  we  discuss  the  nodal 
delay  for  other  service  disciplines.  For 
these  disciplines,  we  only  studied  the 
nodal  performance  for  p  =  0.875  and  50 
percent  of  the  traffic  being  shared. 

Table  2  tabulated  the  nodal  delay  from  our 
nodal  simulation  for  the  ASD,  the  SLQD ,  and 
the  FSD.  We  can  make  the  following 
observations  from  Table  2: 

1.  A  general  characteristic  of  the 
results  in  Table  2  is  that;  at 
high  utilizations  (p=0.875  in  this 
case)  with  a  high  percentage  of 
the  traffic  shared  ( 5 0  percent), 
the  nodal  delay  is  closer  to  that 
of  the  M/M/2  than  the  M/M/1 
queueing  system  regardless  of  the 
service  disciplines. 

2.  The  PSD  has  the  best  nodal  delay 
among  all  service  disciplines. 


5.  The  .dal  delay  •>!'  the  ,  wl’:, 

the  mod  1  flc.lt  1  hi  «.f  not  st-rvlr;.' 
v whenever  possible  )  two  pack..- to  1:. 
a  row  from  the  shared  queue,  1:.  *  t- 
closest  to  that  of  the  i’f-D. 

4 .  The  ASD  with  parameter  x  =  l  ar.d  •  he 
threshold  at  the  shared  queue  set 
to  zero,  has  almost  the  worst  nodal 
delay  performance. 

5.  The  RSD,  being  that  it  is  an 
"uncontrolled"  service  discipline, 
does  perform  the  worst  among,  all 
service  disciplines.  However,  the 
PRSD  performs  better  than  the 
M/M/1  queueing  system  by  almost 

40  percent. 

Table  2  Nodal  Simulation  Results 
(A  «  2*1.75  packets/sec, 

Aj=3.5  packets/sec,  u=4  packets/sec. 

P'0.875) 


Service  Discipline 

Nodal  Delay 

PSD 

1.03 

SLQD 

Serve  the  Longer 
Queue  but  not 
consecutive  "shared 
queue"  packets 

y  ■  1 

1.04 

y  *  2 

1.09  .... 

y  '  3 

1.08 

y  =  - 

i.lj 

ASD 

Alternate  Service: 
one  from  shared 
queue  and  x 
from  dedicated 
queue  but  do  not 
serve  shared 
queue  vhen  its 
queue  length  <  TH 

TP  *  0 

x  -  1 

1.11 

x  *  2 

1.12 

x  »  3 

i-.  05 

TH  *  1 

x  *  1 

1.09 

X  «  .? 

1.C5 

X  »  3 

1.07 

TH  *  2 

X  -  1 

1.00 

x  =  2 

1.05 

X  =  3 

1.04 

TH  *  3 

x  x  1 

1.06 

x  =  2 

1.0" 

x  =  7 

l.Ot 

RSD  Random 

1.13 

M7M/Z 

1.01 

M/M/1 

2.00 

3 . 2  Network  Delay  Performance 
3-2.1  Analysis 

In  this  section,  we  study  the  average 
network  delay  performance .  In  cur 
routing  strategy,  instead  of  treating 
each  communications  line  separately,  we 
focus  our  Interest  on  the  whcl-’  node. 
Including  all  Its  communications  links. 
As  a  result,  our  focus  is  on  nodal  delay 
as  opposed  to  link  delay  which  Is 
usually  studied. 


Let  us  assume  that  we  have  ari  expression 
for  nodal  delay  at  node  J,  which  Is 
denoted  by  T,.  The  average  ruir.tcr 
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packets  N(  ,  at  nodal  .!  can  th. re  t  <■ 

written  as  fi ,  =  a-T.,  win.:-..  »-  is  \  h>- 
-J.  “J- 

total  traffic  (  fr  *r.  t  :tb  ••x*  »i  anJ 
internal  traffl:  sources  i  .a:  de  J.  The 

average  number  of  packets  it.  th-  network 
(N)  Is  therefore  the  sum  of  the  average 
number  of  packets  at  each  nude.  That  Is, 
N  *  Z  N  ,  .  The  average  network 

X-  NODES  J- 

delay  (N)  Is  therefore 


T  = 


Z  X  — T1 
NODES  *■ 


(1) 


y 

where  y  1  Z  '  .  Is  the  total  external 
J.,k  ii 

network  traffic  demand  and  y4  Is  the 

offered  rate  of  traffic  between  node  pair 
(J,k).  Equation  (1)  Is  therefore  a 
general  expression  for  the  average  network 
delay . 

To  analyze  the  network  delay  we  make  the 
following  assumptions.  We  assume  the  node 
models  used  previously.  The  traffic  Is 
found  by  applying  the  routing  Implied  by 
the  preassigned  paths.  We  can  then 
calculate  the  rate  In  packets  per  second 
of  arrivals  to  the  shared  and  dedicated 
queues.  In  our  network  analysis  we 
assume  these  arrivals  to  be  a  Poisson 
process  and  compute  the  nodal  delay  for 
each  node  using  the  techniques  described 
above.  Equation  (1)  gives  us  the  network 
delay.  Below  we  compare  this  technique 
with  the  results  of  two  simulations.  The 
first  dispenses  wit I  ‘he  Poisson  assump¬ 
tion  and  accurately  models  arrivals  to 
Intermediate  nodes  In  paths.  To  make 
the  simulation  simpler,  packet  lengths 
are  Independently  reassigned  at  each  node 
In  a  path.  The  second  simulation 
dispenses  with  this  approximation  and 
retains  the  Initial  pacKet  iength  for  a 
throughput  Its  path. 

3.2.2  Simulation 

In  this  section,  we  will  focus  our  discus¬ 
sion  on  two  PL/1  simulations  which  were 
written  to  evaluate  the  network  delay 
performance.  The  Interarrival  time  of 
arrivals  to  network  ndoes  Is  assumed  to 
be  exponentially  distributed.  The 
packet  length  Is  assumed  to  be 
exponentially  dlstrlouted.  All  servers 
are  assumed  to  have  the  same  service 
capacity  (for  convenience  as  unity). 
Therefore,  the  average  service  rate  of 
each  server  Is  the  same.  In  most  of  our 
simulation  studies,  we  have  used 
p*0.875,  50  percent  shared  traffic  at 
the  ndoes,  and  a  symmetrical  network. 

Other  traffic  ut  1 1  is  at  I  ns,  ncd.il  tnfflo 
nixes,  and  asymmetric  network  traffl  .- 


demand.,  ire  also  studied  but  not 
extensively  is  tne  ..t  ae. 

The  first  simulation  all  -w.:  t  s  • 
lengths  to  be  reassigned  at  the 
node.  It  does  not  make  any  ass^mt  •  1 
abcut  the  output  process  from  a  sej-vr. 

A  server  serves  Its  competing  queues 
according  to  a  pre-spec  1  fled  service 
discipline.  After  the  server  finishes 
service  of  a  packet,  the  packet  el* 
exits  from  the  network  or  Joins  the  m' 
queue  for  more  service.  Onto  1*  arris • 
at  the  next  queue  (In  the  next  r.,ic  , 
new  packet  length  from  the  same  ex: 
nentlal  distribution  Is  assigned  •  It. 

It  then  Is  put  on  the  appropriate  pje... 
and  waits  for  service.  We  comp-are 
results  of  our  network  simulation  witr. 
our  analysis  based  upon  nodal  simuiatl  :. 
results.  In  the  latter,  we  assume  all 
Inputs  to  the  node  to  be  Poisson. 

The  second  simulator  simulates  the  true 
service  situation  in  an  operating 
packet  network.  That  Is,  a  particular- 
packet  when  served  by  the  next  server- 
will  require  the  same  amount  of  service 
time.  When  the  results  obtained  from 
the  second  simulation  are  compared  wit: 
our  analysis  by  using  cur  nodal  simuiatl  : 
results,  we  can  evaluate  the  effect  cf 
both  the  Poisson  output  stream  and  tne 
packet  length  reassignment  assumptions. 

3 . 2 . 2 . 3  The  First  Network  Simulation 

3. 2. 2. 1.1  Local  Service  Discipline.. 

All  four  service  disciplines  are  studied. 
The  simulation  results  in  Tabie  3  and  \r 
previous  work  (Reference  [5])  reveals 
the  following: 

1.  The  average  network  delay  from  tire 
network  simulation  Is  s Ignl f leant iv 
closer  to  that  of  the  M/M/2  nodal 
than  to  the  M/M/1  nodal  model. 

2.  The  difference  between  the  analyt¬ 
ical  results  using  the  nodal  mod'*, 
and  those  obtained  from  the  netw.:-.-: 
simulation,  for  the  PCD,  increase;- 
as  the  percentage  of  shared  traffl: 
increases  at  a  node. 

3.  With  the  same  traffic  mix  at  a  n.  :■ 
the  difference  between  the  netw-.-- 
delay  found  analytically  and  th  v- 
obtained  from  the  network  at mu  l  r  it 
for  the  PSD  Increases  as  the  traffic 
load  Increases. 

4 .  The  network  delay  obtained  from  *  h- 
network  simulation  for  the  A  1 1 »■  rr.  it •  ■ 
Service  Discipline  (ASP)  and  *  he 
Serve  the  Longer  Ueuo  11.:  if  .in- 
(SLQD)  are  lower  than  tnat  f  '  !.•• 


-C  • 


•  • 
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Table  3 


Network 


S  1  mu  1  at.  1  on 
packet/:;.-.,-,  I  =  }  .  9 


Hesu  1  L  s 
paekels/sec  , 


l*  1 1  "ll  1  ' ' •  0  pack.- 1  •;/ , 


-  ■ ; .  H  1 , ; 


♦ 


Service  Discipline 

Network  Dela 

y  ( seconds  ) 

X  Difference* 

Nodal  Model 

Simulation 

PSD 

2.06 

2.7S 

33* 

ASD 

Alternate  Service: 
one  from  shared 
queue  and  n 
from  dedicated 
queue  but  do 
not  serve 
shared  queue 
when  Its  queue 
length  <_  TH 

TH  -  0 

n  -  1 

2.22 

2.05 

20* 

n  -  2 

2.2k 

2751 

12X 

n  ■  3 

2.10 

2.5S 

7JJ 

TH  -  1 

n  *  1 

27TB 

2.50 

isi 

n  *  2 

2 . 10 

2 . 50 

195 

n  -  3 

2.1i4 

2.51 

id* 

TH  -  2 

n  ■  1 

2.16 

2.59 

2ox 

n  »  2 

2.10 

2.52 

20? 

n  -  3 

2. OB 

2.62 

26J 

TH  -  3 

n  «  1 

2.12 

2.51 

Tffx 

n  •  2 

2.14 

2. 49 

17* 

n  -  3 

2.10 

2.55 

2lX 

SLQD 

Serve  the  Longer 
queue  but  not  y 
consecutive 
shared  queue 
packets 

V  -  1 

2.08 

2.45 

18* 

y  -  2 

T7TB 

2 . 58 

19X 

y  ■  3 

2.16 

2.67 

24X 

y  *“ 

05 

2.67 

19X 

RSD 

2.26 

2.71 

20X 

M/M/2 

2.02 

_ 

_ 

M/M/1 

Oo 

- 

- 

■i 


1 


4 


. 


-M 


•  f  D1frerence  -  TNETWORK  (Simulation)  -  TNETWORK  (Nodal  Model) 

TNETWORK  (Nodal  Model) 


Table  4  show  our  results.  We  make  the 
following  observations: 

1.  The  network  delay  of  the  best 
look  ahead  scheme  has  an  Improve¬ 
ment  of  about  11  percent  over  that 
of  the  best  local  scheme. 

2.  The  look  ahead  service  scheme  per¬ 
forms  consistently  better  than  the 
local  service  disciplines. 

3.  When  both  servers  are  empty.  It  is 
better  to  serve  an  arrival  to  the 
shared  queue  on  the  basis  of 
downstream  Information,  than  to  do 
It  randomly. 

3. 2. 2. 2  The  Second  Network  Simulation  - 
No  Packet  Length  Reassignment 

The  analytically  obtained  network  delay 
and  our  first  simulation  differ  not 
only  In  the  PSD  but  also  In  the  ASD  and 
the  SLQD  where  the  output  stream  should 
be  somewhat  smoother.  Furthermore,  as 
was  demonstrated  by  Kle'nrock  (Reference 
[5]),  the  packet  length  reassignment 
assumption  does  not  work  well  in  a  small 
network  like  our  four  node  network  with 
very  few  traffic  streams  flowing  at  each 


node.  This  motivated  us  to  build  the 
second  network  simulator  whereby  the 
packet  length  reassignment  assumption 
Is  not  made. 

3. 2. 2. 2,1  Dynamic  Routing  Scheme  With 

Local  Node  Service  Disciplines 

We  have  used  our  second  simulation  to 
evaluate  the  network  delay  performance 
under  the  various  service  disciplines 
(PSD,  ASD,  SLQD,  and  RSD)  discussed 
previously.  Each  node,  operates 
Independently,  serves  Its  queues  accord¬ 
ing  to  the  pre-spec  1 f led  service 
discipline.  Traffic  Information  at  each 
node  is  not  exchanged  between  adjacent 
nodes.  Therefore,  only  "local"  node 
service  disciplines  are  discussed  In  this 
section.  We  have  also  examined  the  aver¬ 
age  network  delay  under  two  fixed  routing 
strategies  (Reference  (5j). 

Table  5  tabulates  our  results.  We  make 
the  following  observations: 

1.  Removing  the  packet  length  assump¬ 
tion  greatly  Improves  the  accuracy 
of  our  analytic  nodal  method. 


■ 
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Tabled  Network  Simulation  Results  Or  A  Look  Ahead  Routing  Strategy 

(Xll'Xl?=  0  Packe‘/3ec  •  i.j'3.5  packets/ sec  ,  packet  s/sec ,  p*O.S'f.) 


Threshold 

Network  Delay  (seconds) 

x  9  Q  -v  ® 
x2  W21 

x2  ’  Q2l" 

X.  m  0  +  i  0  1# 

2  W21  2  w  23 

0  (PSD) 

2.75 

2.7? 

_ 

1 

2-33 

2.2o 

2.27 

2 

2  .  36 

2.29 

2.19 

_ 

2.37 

2.32 

2.26 

5 

2.55 

2.42 

2 .  <  2 

5 

- 

- 

2.35 

- 

- 

2.41 

*  When  both  S^  and  S^2  are  not  busy  and  there  Is  an  arrival  to  Q^,  this 
packet  will  be  served  by  either  or  S12- 


•*  Under  the  same  situation  as  above,  x2  will  be  compared  with  x^  (the  same 
parameter  at  node  4). 

The  packet  in  will  be  served  by  (of  S12)  if 

x2  <  (or  x2>x|() 


2.  The  packet  length  reassignment 
assumption  has  more  effect  on  the 
network  delay  performance  for  the 
PSD  than  any  other  service 
disciplines . 

3.  The  PSD  Is  the  best  "local"  service 
disciplines  among  all  other  local 
service  disciplines. 

The  network  delay  of  our  dynamic 
routing  scheme  under  any  one  of 
the  service  disciplines  is 
significantly  better  than  either 
of  the  benchmark  fixed  routing 
strategies . 

3 . 2 . 2 . 2 . 2  Dynamic  Routing  With  Look 
Ahead  Service  Scheme 

The  "clock  driven"  network  simulation 
keeps  track  of  how  much  time  a  packet 
requires  from  a  server.  This  enables 
a  server  to  know  how  much  work  (In  terms 
of  service  time)  the  next  server  has  at 
any  given  Instant  of  time.  A  server  can 
make  an  accurate  decision  to  keep  the 
network  from  losing  Its  service  capacity. 
A  server,  upon  completing  a  service, 
compares  the  total  amount  of  work  at  the 
next  server  (the  remaining  service  time 
for  the  packet  still  in  service  at  the 
next  server,  the  total  service  time  In 
the  dedicated  queue  at  the  next  node, 
the  total  service  time  in  the  shared 
queue  at  the  next  node,  and  a  threshold 
parameter  which  Incorporates  the  fact 


that  the  shared  queue  packets  are  served 
by  both  servers  at  the  next  node  and  there 
are  new  arrivals  to  the  next  node  during 
the  service  time)  to  the  sum  of  the  service 
times  of  the  first  packet  In  each  of  Its 
competing  (shared  and  dedicated)  queues. 
The  server  serves  the  shared  queue  packet 
if  the  total  amount  of  work  at  the  next 
server  is  less  than  the  sum  of  the  service 
times  of  the  first  packet  In  each  of  its 
competing  queue.  Should  the  server  not 
serve  the  shared  queue  packet  Immediately, 
the  next  server  could  be  idle  for  a  period 
of  time  equal  to  the  difference  between 
these  two  quantities.  The  server  would 
otherwise  serve  the  dedicated  queue 
packet.  This  look  ahead  service  scheme 
would  be  Ideal  in  our  dynamic  routing  If 
the  threshold  parameter  could  be 
correctly  set.  The  network  delay  with 
both  exponentially  distributed  and 
constant  packet  lengths  are  evaluated. 

The  following  observations  can  be  made 
(see  Table  6): 

1.  The  PSD  is  equivalent  to  setting 
the  threshold  value  to  negative 
infinity . 

2.  The  network  delay  Is  insensitive 
to  the  threshold  values  near  its 
"optimal"  setting. 

3.  The  difference  between  the  network 
delay  of  the  "best"  local  service 
discipline  (the  PSD)  and  the  look 
ahead  service  scheme  wltn  the  "te..t" 


choice  of  the  ‘hi—.  2d  value  1 s 
about  10  percent . 

Table  5  Net  w  -rl:  ir.ul  at  Iti  Results  h  >r  Different  Service  Discipline:;  - 
Local  Dec  Is  lor. 

^11=  i.2~°  Pac'<et/3econcl ,  An  =  3.S  packets/second,  Packets/3ecor'3 

p»0 . 875 ) 


Servl ce 

Disc  It  1 lne 

Network  Delav  (seconds) 

rTrst 

.7  imu  lat  1  on 

Second 

Z  1  r.u  1  'it  i  on 

Analvt  Ic 

POD 

2  .  V  b 

2  1  'j 

.  Oh 

ACD 

2.32 

j .  ti  2 

SLQD 

2. 4;, 

2.32 

2.03 

RSD 

.  2.73 

2.^6 

2.26 

Fixed  Routing 

#1 

3.84 

— 

Fixed  Routing 

» 2 

— 

3.26 

— 

Table  6  Network  Simulation  Results  For  A  Look  Ahead  Routing  Strategy 

^11  =  *12=<^  Pac*cet//seccnd *  3^3=3-5  packets/second,  un~yi2=i<  packets/ 
second,  p=0.875 


Routing  Strategy 

Network  Delav  (seconds) 

Exponential  Packet 
Lengt  r.s 

Constant  Packet 
Lengths 

- 

PSD 

*5  i  £ 

1.23 

LOC  Sl  X 

RSD 

2 . 56 

1.31 

-0.25 

1 . 36 

1.17 

0.00 

-  •  9Jt . 

1  .  14 

0.05 

.1 . 9  6 

_ 

Look 

0.10 

1 . 9* 

-- 

Ahead 

0.15 

1.95 

— 

0.25 

1.96 

1.20 

0.50 

2.01 

1.27 

0.75 

2.07 

1.33 

1.00 

2.13 

1.38 

3. 2. 2. 2. 3  Asymmetric  Traffic  Demands  And 
Service  Rates 

We  have  Illustrated  significant  Improve¬ 
ments  In  network  delay  using  our  adaptive 
techniques  for  the  four  node  network. 
However,  only  symmetric  traffic  leads 
have  been  studied  so  far.  In  order  to 
demonstrate  the  robustness  of  our 
routing  strategy,  a  certain  degree  of 
asymmetry  was  !ntr  dared  into  the  network. 
We  have  evaluated  too  network  delay  of 
the  four  node  syrmet r 1c  network  w 1  ‘  h 
asymmetric  tr:fflc  m !  with  different 
link  capacities. 

We  make  the  following  observations  from 
the  results  of  Tables  7  and 

The  RSD  per  f  res  t  he  worst  am  rig  a :  1 
service  'll::  • ;  1  !:.••.■  ;  *  h-  t'.T  4 

the  b>T,‘  ar  •;.*  1 1  1  r  j’  r.g  .51  ra‘  -vie  ; 


based  on  local  traffic  information;  the 
"best"  look  ahead  scheme  performs 
better  than  the  PSD  in  terms  of  aver¬ 
age  network  delay;  and  the  threshold 
value  set  In  the  look  ahead  scheme  Is 
insens'  ! /e  to  the  network  delay 
performs  ce  over  a  wide  range. 

4 .  CONCLUSION 

We  have  studied  a  dynamic  routing  scheme 
for  packet  switching  :,•••* 'res.  Its  not  - 
work  delay  per  format:  .••••  is  cut  or!  r  t 
that  of  fixed  routing.  We  nave  tls 
developed  ar.d  evaluated  an  analytic 
technique  fer  evaluating  alartive  rcut  ir.g 
performance.  Extensive  Simula*,  lens  con¬ 
firm  these  results.  A  locally  adaptive 
scheme  perform  almsst  as  we  1 1  is  one 
that,  has  more  lnforr.it  1  >n. 
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Table  7  Network  Delay  Wltti  Asymmetric  Traffic  Demand a 

(A  12“2  packets/sec,  Aj.,-1  packet/sec,  Ajj*’’  packets/sec, 

^lj"^  packets/sec,  p»0.875) 


Table  8  Network  Delay  With  Asymmetric  Service  Rates 

(*il“*i2*°  Pacicets/sec  •  ^ 1 3m3 - 5  packets/sec,  mu ix“p22“u 33-lJ,*2u“  -* 
packets/sec,  A'i2“,i2l“M32"li4l"^  packets /sec,  p-0.875 
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A.  Studies  in  Adaptive  Routing 
A.  1 .  Stable  Routing  Patterns 

As  part  of  a  general  investigation  in  the  area  of  dynamic  routing 
in  computer  communication  networks  we  consider  the  problem  of  finding 
stable  global  routing  patterns.  Specifically,  we  have  proposed  [1]  a 
two-level  routing  procedure  where  the  lower  (local)  level  adapts  dy¬ 
namically  to  instantaneous  variations  in  the  congestion  of  the  network  in 
the  immediate  vicinity  of  each  node  and  the  higher  (global)  level  en¬ 
sures  stability  by  keeping  the  average  load  across  the  entire  network  in 
some  sense  globally  balanced.  We  now  consider  this  latter  problem. 

We  consider  as  a  measure  of  global  balance  the  utilization  of  the 
most  heavily  utilized  network  element  (node  or  link)  and  seek  to  mini¬ 
mize  this  quantity.  For  simplicity,  we  will  speak  only  of  link  utiliza¬ 
tions.  (Node  utilizations  can  be  included  in  a  straightforward  manner.) 
Thus,  we  are  given  a  network  containing  N  nodes  and  L  (directed) 
links.  Each  link,  1,  has  a  capacity  Cy  There  are  (directed)  require¬ 
ments  r •  j  between  nodes  i  and  j.  Each  r^  is  satisfied  by  routing  it  on 
one  or  more  paths  P^jj  from  i  to  j.  (In  the  two  level  adaptive  routing 

scheme,  these  paths  (or  links  within  them)  will  be  the  alternatives  open 

( K ) 

to  each  requirement.)  A  routing  pattern  is  defined  by  the  paths  P  ^ 

and  the  fraction,  f^.}  of  r..  using  each  The  utilization  of  each 

i]  i)  i) 

link  is  equdl  to  the  total  flow  (sum  of  fractions  of  requirements)  on  the 
link  divided  by  its  capacity. 

The  maximally  utilized  link  is  in  a  sense  the  most  vulnerable  part 
of  the  network  and  the  most  likely  cause  for  the  dynamic  routing  mech¬ 
anism  to  break  down  (e.g.  loop)  due  to  congestion.  By  minimizing  the 
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utilization  of  the  maximally  utilized  link  we  seek  to  minimize  the  chance 
of  congestion  leading  to  such  failure.  It  should  be  noted  that  we  are 
dealing  with  the  global  level  of  the  routing  procedure  here  and  as  such 
consider  only  long  term  average  utilizations,  not  instantaneous  meas¬ 
ures.  The  local  level  of  the  routing  procedure  concerns  itself  with 
making  decisions  instantaneously  on  the  basis  of  the  local  state  of  the 
network  in  the  vicinity  of  a  node.  Even  within  the  constraints  of  a 
given  P^.^and  defined  by  the  global  strategy,  the  local  level  has 

considerable  flexibility  in  choosing  when  to  use  each  route  and  as  such 
can  obtain  substantial  reductions  in  delay  when  compared  with  static 
routing  policies. 

We  now  turn  to  the  problem  of  actually  finding  the  optimal  global 
routing  pattern  as  defined  above.  The  technique  resembles  the  Flow 
Deviation  Method  of  Cantor  and  Gerla  [2]  and  will  be  described  in 
similar  terms.  Cantor  and  Gerla  sought  to  minimize  the  average  delay 
whereas  we  seek  to  minimize  the  maximum  utilization  of  a  link.  Both 
functions  are  convex  functions  over  a  convex  region  and  as  such,  the 
same  type  of  procedure  can  be  proven  to  yield  an  optimal  routing 
pat'ern.  The  proof  is  given  in  (2). 

Our  function  is  only  piecewise  differentiable  and  as  such,  the 
gradient  search  used  in  [2]  is  not  appropriate.  In  fact,  the  alternative 
described  here  takes  the  special  nature  of  the  objective  function,  a 
minimum  of  linear  functions,  into  account  and  not  only  overcomes  its 
non-differentiability  but  also  is  considerably  more  efficient  and  easier 
to  implement  than  a  gradient  search. 

We  now  give  an  outline  of  the  optimization  procedure.  A  high 
level  flowchart  of  this  procedure  is  given  in  Figure  1.  As  mentioned 


above,  at  this  level  the  procedure  is  almost  identical  to  the  Flow  De¬ 
viation  Algorithm.  The  key  difference,  which  is  only  evident  in  a  more 
detailed  description,  is  how  the  optimal  superposition  of  flows  is  found. 

We  define  the  length  of  a  link  to  be  its  utilization.  Initially,  we 

set  all  link  lengths  to  0.  We  define  the  length  of  a  path  to  be  the 

length  of  the  longest  link  in  the  path  plus  a  small  constant  times  the 
number  of  links  in  the  path.  This  latter  term  is  added  to  break  ties 
among  paths  with  equally  utilized  links  in  favor  of  a  path  with  the 

smallest  number  of  links.  Note  that  this  definition  of  path  length  is 
different  from  the  conventional  one  but  it  serves  our  purpose.  Short¬ 
est  paths  using  this  metric  are  computable  using  conventional  shortest 
path  algorithms. 

The  routing  pattern  found  at  each  stage  in  the  optimization  proce¬ 
dure  is  a  single  shortest  path  for  each  requirement  r...  (In  general, 
this  path  is  not  unique,  but  this  poses  no  problem.)  A  flow  pattern  is 
defined  as  the  total  flow  in  each  link  and  is  found  by  loading  the 

requirements  onto  the  links  specified  in  the  current  flow  patterns. 

An  optimal  superposition  of  the  current  flow  pattern  with  all  pre¬ 
vious  flow  patterns  is  then  found.  This  is  the  key  step  in  the  proce¬ 
dure  and  is  done  by  finding  the  value  of  A  between  0  and  1  which 
minimizes  V,  the  maximum  lini.  utilization,  where  A  represents  the  frac¬ 
tion  of  all  previous  flow  patterns  used.  The  new  optimal  superposition 
of  flows  is  then  A  times  the  previous  superposition  plus  ( 1  -A )  times  the 
current  flow  pattern.  A  new  superposition,  and  hence  link  utilizations, 
is  then  obtained.  This  in  turn  yields  a  new  value  for  V(K)  and  the 
link  lengths.  We  can  now  start  another  iteration.  If,  however,  no 
improvement  in  V(K)  has  been  observed,  the  iteration  has  converged 


and  we  terminate  the  procedure  with  an  optimal  flow  pattern.  By 

( K ) 

saving  the  routing  patterns  and  A  ,  the  values  of  A  for  each  K,  the 
optimal  routing  pattern  can  be  obtained.  In  particular,  if  P  j.  is  the 
(i,j)  path  first  used  in  the  routing  pattern  in  iteration  K  then  the 
fraction  of  commodity  (i,j)  using  Pv  ^ is 

(1-A(K))  n  A(^ 
j=K+l 

where  M  is  the  number  of  iterations  and  the  product  is  defined  equal  to 
1  for  K  =  M. 

We  now  turn  to  the  problem  of  how  to  find  the  optimal  super¬ 
position  of  flows.  Consider  two  flow  patterns,  and  r^2\  Each 

flow  pattern  assigns  a  flow  to  each  link.  Thus  f^  and  f^  are  the 
flows  assigned  to  link  (i,j)  in  and  F^\  respectively.  For  any  A 

between  0  and  1,  the  flow  assigned  to  link  (i,j)  bv  superposing  AF^  ' 
and  (l-A)F^  is  then 

Af(1)+  (1-A)f(2) 
il  ij 

which  equals 

f(2)  +  A(f(1.)-  f(2)) 
ij  i]  i) 

which  is  a  linear  function  of  A.  Dividing  by  Cj,  to  obtain  utilizations , 
there  will  be,  in  general,  a  different  function,  a^  +  bjA  for  each  link  1. 
(We  will  for  simplicity  refer  to  links  by  a  single  index,  1,  rather  than 
endpoints  ( i ,  j ; .  ) 

We  seek  the  value  of  A  which  minimizes  the  maximum  of  these 
functions  over  all  1.  Several  simple  observations  allow  us  to  find  this 
value  of  A  in  an  efficient  and  straightforward  manner.  First,  if  for  two 


links,  I  and  m,  a,  >  a„  and  b,  >  b„  then  link  I  is  said  to  dominate  link 

m  and  link  m  can  be  ignored  as  it  clearly  does  not  participate  in  the 

maximum  since  a,  +  b,A  >  a  +  b  A  for  all  values  ol  A.  Indeed,  it  a.  + 

b|\  >  a  e  bmA  for  all  A  between  0  and  1  then  link  m  may  be  ignored. 

Note  that  this  latter  condition  is  not  equivalent  to  the  former,  for 

example  if  a,  =  10,  b,  =  0,  a  =  2,  and  b  =3. 

1  l  m  m 

We  can  then  arrange  the  links  1  in  descending  order  of  aj  and 
examine  the  b,.  Any  link  m  for  which  b  does  not  exceed  b,,  where  1 
is  the  predecessor  of  m  in  the  order,  can  be  eliminated.  We  now  have 
an  ordering  of  links  which  is  descending  in  aj  and  ascending  in  bj.  We 
then  compute,  for  each  adjacent  pair  of  links  1  and  m,  the  value  Aj  for 

which  aj  +  b|Aj  =  am  +  b  Aj.  The  Aj  should  form  an  ascending  se¬ 

quence.  A  value  of  A^  which  is  less  than  the  value  of  its  predecessor 
in  the  sequence  corresponds  to  a  link  1  which  can  be  eliminated  from 
further  consideration.  In  this  case,  link  1  is  eliminated  and  m  has  a 
new  predecessor.  The  Ap  is  then  recomputed  for  link  p  the  new  pre¬ 
decessor  of  m  and  this  process  is  repeated.  For  link  n,  the  last  link 

in  the  sequence,  A  =  1 . 

We  now  compute  for  each  remaining  link  I  Vj  =  dj  +  b|Aj  and  select 
the  minimum  of  these  values.  The  resulting  Aj  and  Vj  are  the  desired 
values  yielding  the  optimal  superposition. 

1'his  entire  process  is  illustrated  in  Tigure  2.  The  links  have 
been  sorted  so  that  the  aj  are  descending.  I.inks  with  nonascending  bj 
have  already  been  eliminated.  Thus  the  b^  form  an  ascending  sequence. 
This  is  evident  in  Figure  2  by  the  fact  that  the  lines  term  a  sequence 
increasing  in  slope.  The  intersection  of  lines  1  and  2  (i.e.  the  lines 
starting  at  dj  and  a.,)  defines  Aj  .  Similarly,  the  intersec  tion  ol  lines 
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2  and  3  defines  and  ^  >  thus  ^ar  no  **ne  *s  dominated.  The 

intersection  of  lines  3  and  4  takes  place  between  A  =  1  so  line  4  is 
dominated  by  line  3  and  line  4  is  thus  eliminated  from  further  consider¬ 
ation.  The  intersection  of  lines  3  and  5  defines  a  value  of  A^  (dotted 
line),  but  when  Ag  is  computed  we  find  it  to  be  less  than  A^ .  So,  line 
5  is  dominated  by  line  6  and  removed  from  further  consideration.  A^  is 
then  recomputed  from  the  intersection  of  lines  3  and  6.  A.g  is  computed 
from  the  intersection  of  lines  6  and  7.  Finally  A?  =  1.  This  leaves  us 
with  A,,  A2,  A3,  Ag,  and  A?  (also  AQ  =  0).  We  search  among  the  cor¬ 
responding  Vj  and  find  v^  is  minimum.  It  and  A^  define  the  desired 
superposition. 

The  entire  optimization  process  is  illustrated  in  Figures  3,  4,  and 
5.  The  network  consisting  of  3  nodes,  6  links  and  6  requirements  is 
shown  in  Figure  3.  For  simplicity,  we  assume  symmetric  requirements 
and  link  capacities.  We  can  thus  assume  a  symmetric  solution,  i.e., 
routes  for  r-  the  reverse  of  routes  for  r-.  and  equal  utilization  of  each 
link  in  both  directions.  This  allows  us  to  only  consider  3  links  and  3 
requirements  in  the  example.  This  is  done  to  simplify  the  example. 
The  actual  procedure  works  with  directed  links  and  requirements.  It 
can  also  be  used  with  undirected  links  and  requirements  but  such  a 
situation  is  rarely  physically  meaningful. 

Initially,  all  requirements  are  routed  directly  since  the  initial 
shortest  paths  by  our  definition  would  be  the  paths  with  the  minimum 
number  of  links.  This  is  illustrated  in  Figure  4a.  The  link  lengths 
are  then  recomputed  --  link  1  has  a  utilization  of  .3  and  hence  a  length 
of  .3,  etc.  The  shortest  paths  are  then  recomputed  and  are  shown  in 
f  igure  4b.  Note  that  the  shortest  path  from  B  to  C  is  now  B-A-C. 
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Fhe  requirements  are  loaded  onto  these  paths.  The  flow  pattern  is 
shown  in  Figure  4c. 

Mow  a  superposition  of  the  flow'  patterns  in  Figures  4a  and  4c  is 
done.  Figure  5  illustrates  the  dynamics  of  this.  Note  that  r^,  domi¬ 
nates  r^  and  that  the  optimal  A  is  .9  and  v  =  4.5.  Figure  4d  shows 
the  resultant  routing  pattern  formed  by  using  the  first  routes  for  90% 
of  the  traffic  and  the  second  routes  for  the  remaining  10%.  Figure  4e 
shows  the  flow  pattern  resulting  from  the  superposition.  Note  that  the 
maximum  utilization  is  .45  (in  links  (A,C)  and  (B,C))  which  is  less 
than  the  maximum  in  either  of  the  patterns  in  Figures  4a  and  4c. 

We  now  recompute  the  link  lengths  and  the  shortest  paths.  The 
resultant  routes  are  the  same  as  in  Figure  4a.  An  optimal  superposition 
between  this  flow  pattern  and  the  one  in  Figure  4e  is  then  done.  The 
optimal  value  of  A  is  1 ,  no  improvement  in  v  is  found  and  we  conclude 
that  the  routing  and  flowr  pattern  in  Figures  4d  and  4e  are  optimal. 
Note  that  the  links  (A,C)  and  (B,C)  are  both  maximally  utilized.  They 
form  a  cut  which  is  analogous  to  the  saturated  cut  in  Gerla's  Cut  Satu¬ 
ration  Method.  (The  existence  of  such  a  cut  is  a  necessary  condition 
for  the  optimality  of  a  flow  pattern.) 

We  thus  have  developed  a  simple  and  efficient  algorithm  for  obtain¬ 
ing  stable  flow  patterns  for  use  globally  as  the  higher  level  in  our  2 
level  adaptive  routing  procedure.  In  the  coming  months  we  hope  to 
implement  this  pr  :edure  and  experiment  with  it. 

In  an  allied  study  we  investigated  a  pattern  for  placing  virtual 
calls  on  a  network.  A  simulation  program  was  written  to  directly  ob¬ 
serve  the  dynamic  performance  of  an  algorithm  which  loads  calls  on 
alternate  routes  according  to  the  following  algorithm: 
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1.  Load  each  incoming  call  onto  the  route  currently  carrying  the 
smallest  number  of  calls.  (The  number  of  calls  carried  by  a 
route  is  defined  for  the  purposes  of  this  algorithm  to  be  the 
number  of  calls  on  the  first  link  in  the  route.) 

2.  If  there  is  a  tie  among  several  routes  in  a  set,  S,  select 
route  i  with  probability  P- ( S ) . 

The  simulation  was  written  to  provide  us  with  a  first  glimpse  of 
the  dynamic  performance  of  such  a  procedure  as  a  guide  for  further 
research  in  this  area.  We  thus  wanted  to  keep  it  as  simple  as  possible 
and  considered  a  3  node  6  link  network  as  shown  in  Figure  5a  with 
symmetric  requirements.  The  program  can  easily  be  expanded  to  con¬ 
sider  more  general  cases  but  we  chose  this  simple  one  initially  in  order 
not  to  obscure  the  basic  results. 

Calls  arrive  at  each  node  at  a  rate  k  (Poisson)  and  are  served  at 
rate  p  (exponential)  by  the  links,  i.e.,  have  exponential  duration  with 
average  length  1/p.  Each  call  has  a  choice  of  a  1  hop  path  or  a  2  hop 
path.  A  call  arriving  at  a  node  is  equally  likely  to  be  destined  for 
either  other  node.  Thus,  there  is  total  symmetry  in  the  system.  It 
should  be  noted  that  a  call  taking  a  2  hop  path  occupies  2  links  but 
remains  in  the  system  for  time  1/p  on  average  (not  2/p). 

The  simulation  is  straightforward.  Call  arrivals  are  generated 
randomly  and  arriving  calls  are  routed  according  to  the  algorithm  given 
above.  The  number  of  calls  taking  the  1-hop  and  2-hop  routes  were 
recorded  for  each  run.  A  parameter,  a,  determined  the  probability  of 
taking  the  1-hop  route  when  there  was  a  tie  between  the  2  routes  ( a  = 
Prob  {using  the  1-hop  route  in  case  of  a  tie}). 


6 


4 


lor  a  =  .5  the1  traction  of  calls  taking  the  1-hop  route  was,  not. 
surprisingly,  very  close  to  f'or  a  =  0,  however,  the  fraction  varied, 
lor  \  p  -  1,  80",  of  the  calls  took  the  2-hop  path,  lor  A/p  =  10,  66",  ot 
the  calls  took  the  2-hop  path.  Tor  A/p  =  50,  65",  of  the  calls  took  the 
2-hop  path.  lor  0=1,  the  results  (fraction  on  1-hop  versus  fraction 
on  2-hop  paths)  reversed  relative  to  the  results  for  u  =  0. 

We  thus  conclude  that  we  have  some  control  but  not  total  control 
over  the  routing  via  a  which  only  operates  during  a  tie.  The  control 
gets  greater  for  systems  with  a  smaller  number  of  calls  in  progress,  as 
evidenced  by  the  results  for  smaller  values  of  A/p  (which  is  directly 
related  to  the  number  of  calls  in  the  system).  Observations  of  the 
number  of  calls  in  the  system  at  various  points  in  a  simulation  run  led 
to  the  conclusion  that  the  system  is  stable,  i.e.,  that  the  link  loads 
reach  a  stable  level  and  remain  close  to  that  point  and  close  to  one 
another. 
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ABSTRACT 

This  paper  considers  the  problem  of  routing  traffic  with  many  desti¬ 
nations  through  a  congested  network.  The  problem  is  formulated  as  a 
very  general  dynamic  multicommodity  flow  problem.  The  optimal  control 
problem  which  results  is  a  very  difficult  one  to  solve  analytically  (even  for 
open  loop  solutions)  because  of  the  presence  of  Doth  state  variable  con¬ 
straints  and  delays  in  the  system  dynamics. 

The  difficulties  inherent  in  the  centralized  problem  can  be  avoided  by 
using  a  decentralized  approach  to  obtain  dynamic  routing  strategies.  Some 
of  the  results  obtained  using  this  approach  are  presented. 

INTRODUCTION 

Consider  a  network  made  up  of  N  nodes  connected  by  L  directed  links 
along  which  the  rate  of  traffic  flow  can  be  controlled.  M  of  the  nodes  are 
destination  nodes  toward  which  varying  amounts  of  traffic  must  be  routed 
in  accordance  to  the  traffic  demand. 

When  the  traffic  demand  exceeds  the  capacity  of.  the  network  to  meet 
this  demand  for  service,  congestion  builds  in  the  network.  In  the  model 
used  here  (first  suggested  by  Gazis^)  the  congested  network  is  viewed 
as  a  store  and  forward  network  in  which  all  delays  due  to  congestion  are 
treated  by  assuming  that  queues  can  build  inside  the  network  at  the 
network  nodes.  Traffic  is  assumed  to  move  between  nodes  at  a  constant 
velocity  (this  corresponds  a  fixed  travel  time  between  nodes)  and  is  stored 
at  the  nodes  in  queues  until  it  can  proceed  onward  toward  its  destination. 

The  degree  of  congestion  is  measured  by  the  aggregate  of  all  queues 
in  the  network.  The  problem  considered  here  is  that  which  occurs  after  a 
"rush  hour"  or  period  of  heavy  demand  when  the  demand  has  fallen  below 
the  capacity  but  congestion  is  still  present. 

Using  the  store  and  forward  model  the  queue  dynamics  can  be  expressed 


*The  research  was  supported  by  the  National  Science  Foundation  under 
grant  ENG-77- 14898 ,  and  partially  by  NATO  grant  $1425. 


1. 


N 


k=l 


m 


nk 


N 

(t)  + 
1=1 


m . „  m , 
u.  (t-s.  ) 

in  in 


i  1  ) 

for  n  =  l . 

m=l ,  .  .  , M 
ana  n  -  m. 


where  qnm  denotes  a  continuous  state  variable  which  approximates  the 

length  of  the  traffic  queue  at  node  n  having  node  m  as  its  destination; 

u  ,m  denotes  the  rate  of  traffic  flow  to  destination  m  on  link  (nk);  r  m 
n  k  n 

is  the  traffic  demand  entering  at  n  with  destination  m  and  Sjnm  is  the 
delay  on  link  ( j , n )  encountered  by  traffic  to  m.  For  deterministic  input 
demand,  the  objective  is  to  clear  the  congestion  (so  that  qnm(T)=0  for  all 
n  and  m)  at  a  specified  time  T  which  is  large  enough  to  achieve  this 
objective  while  minimizing  the  aggregate  cost  of  congestion 
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n=l  n  k=l 


unkm(t)]dt 


(2) 


In  this  integral  the  qnm  terms  give  the  aggregate  delay  due  directly 
to  the  congestion  while  the  flow  terms  produce  a  cost  or  aggregate  delay 
associated  with  the  link  flows.  The  constraints  to  be  met 
_  _  M 

%  i  °:  unk  i  0  and  *  unk  1  Cnk  (3) 

m=i 

for  all  m,  n,  k  and  t 


impose  the  requirements  that  queue  lengths  and  flows  must  be  non-negative 

and  that  each  link  has  a  finite  capacity  C  .  . 

(21  nK 

D'Ans  and  Gazis1  have  shown,  that  an  optimal  open  loop  control  for 
this  centralized  control  problem  can  be  computed  by  discretizing  the  time 
appropriately.  When  this  is  done,  the  problem  can  be  expressed  as  a 
linear  programming  problem  and  an  optimal  open  loop  solution  can  be 
computed  for  any  initial  conditions. 

This  problem  with  all  s  ,  m  equal  to  zero  was  considered  by  Moss  and 
(31  nic 

Segall1  They  present  a  set  of  necessary  conditions  via  the  maximum 
principle  but  they  offer  a  general  procedure  for  finding  the  flows  in 
feedback  form,  only  for  single  destination  networks  with  zero  input  demand. 

In  seeking  a  closed  loop  solution  to  this  centralized  optimal  control 
problem,  one  runs  into  considerable  difficulty  because  even  necessary 
conditions  are  nor.  available  for  problems  having  both  state  variable  con¬ 
stants  and  time-delay  systems. 
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Most  of  the  difficuties  inherent  in  the  centralized  formulation  can  be 
avoided  by  adoptinq  a  decentralized  approach  to  this  problem.  In  general 
the  decentralized  approach  to  controlling  large  scale  systems  is  to  use  .1 
number  of  low  level  controllers  operating  on  local  information  to  control 
portions  of  a  large  system.  This  local  information  is  supplemented  by  some 
interchange  of  information  between  local  controllers  or  between  local 
controllers  and  a  supervisory  controller.  How  well  the  overall  system 
operates,  depends  on  what  information  is  exchanged.  The  local  controllers 
are  designed  to  perform  their  limited  tasks  optimally.  For  the  dynamic 
routing  problem  considered  here,  the  local  controllers  are  node  level 
controllers  which  must  assign  traffic  flow  on  the  links  leaving  the  node 
based  on  the  information  available  locally. 


The  Local  Optimization  Problem 


The  optimization  problem  faced  at  a  typical  node  is  based  on  the 

queue  dynamics 

qm(t)  =  Rm(t)  -  I  u  m(t)  (4) 

4=1 

where  Rm  is  the  total  rate  of  traffic  arrivals  (from  both  upstream  nodes 
and  from  outside  the  network)  which  have  the  destination  m  and  there  are 
L  links  which  exit  from  the  node.  We  wish  to  find  the  link  flows  u£m(t) 
as  a  function  of  the  local  state  vector  q(t),  to  minimize 
T  M  L  m  m 

J{u}  =  /  I  [ qm( t )  ♦  I  S.mu.m(t)]dt  (5) 

t0  m=l  4=1  *  * 

where  S£m  denotes  the  cost  or  delay  per  unit  of  flow  on  link  4  to  reach 
destination  m.  The  constraints  are  as  in  (3)  and  q  (T)  =  0.  It  is  ap¬ 
parent  that  since  all  incoming  demand  has  been  consolidated  into  one  term, 
the  local  controller  is  not  attempting  to  control  a  time-delay  system.  A 

feedback  solution  can  be  thus  be  sought  using  Hamilton -Jacobi  theory. 

The  minimum  cost  due  to  congestion  can  be  expressed  as 


T  M 
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where  are  the  optimal  non-congested  flows  (i.e.  the  flows  which 

minimize  J  in  (5)  when  q  (t0)  =  0)  and  J*  is  the  corresponding  minimum 

L  * 

non-congested  cost  on  [t0,  Tj.  Note  that  Rm(t)  =  Z  u.m  1 1 ) .  When  the 
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the  minimum  cost  function  V'(  >2 )  satisfies 
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R‘“  are  constants,  then 

Hamilton- Jacobi  equation 

M  L 

m(t)  ♦  2 
2=1 

where  V(0)  =  0  and  the  minimization  is  subject  to  constraints  as  in  (3).  11 

the  V(  q)  which  satisfies  (7)  and  boundary  conditions  imposed  by  state 
constraints  can  be  found  then  this  would  produce  necessary  and  sufficient 
conditions  for  the  optimal  flows. 

A  complete  solution  to  the  local  optimum  control  problem  is  not  yet 
available.  A  method  for  obtaining  an  approximate  solution  was  presented 


by  Ozgtiner  and  Sarachik 
for  two  special  cases. 


(4) 


However  the  exact  solution  is  known  only 


Two  Special  Cases 

The  Single  Destination  Case  (M=l).  Chu^  first  solved  this  local 
dynamic  routing  problem  using  a  geometric  approach.  Later  Sarachik  and 
Ozgtiner^  obtained  the  same  solution  using  the  approach  above.  Since 
(7)  must  be  minimized  at  each  time  instant,  this  is  equivalent  to  choosing 
the  u£(t)  to  minimize 
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subject  to  the  constraint  0  <  u£(t)  <  C£.  This  says  that  we  should  choose 

|C, 


u£(t)  = 


for  s*  <  I? 


for  s*  i  I? 


(9) 


Thus  if  the  links  are  numbered  according  to  the  ordering  S1  <  S2  £ 
. . .  <  SL  and  if  K  is  the  largest  index  l  satisfying  S£  <  (3V/3q)  we  see 

that  on  links  £=1,2 .  K  the  full  capacity  should  be  used  to  carry  traffic 

flow  whereas  on  links  £=K+1,...,  L  no  traffic  should  flow.  Substituting 
this  result  in  (8)  gives 


iC.  for  q ( t )  >  Y 
U2(t)°pt  !o  for  q(t)  <  Y£ 


(10) 


where  the  thresholds  Y£  are  given  by 

Y£  =  ^  (Ci-V(S£-Si> 

for  £=!,...[". 
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Multi-gestination  with  L =2.  Sarucmk1'1  obtained  a  solution  tc  i  7  •. 
with  arbitrary  \1,  tor  a  node  with  only  two  exit  links.  The  method  ol 
solution  involved  finding  the  solutions  along  the  state  space  axes  (these 
are  one  dimensional  problems)  and  using  the  results  as  boundary  conditions 
lor  the  admissible  state  space  q  >  0  The  derivation  can  be  found  in  [  7  J 
The  results  show  that  the  admissible  state  space  is  divided  into  non¬ 
intersecting  regions  by  planes.  In  each  region  the  optimal  Hows  must 
satisfy  a  necessary  and  sufficient  condition.  The  regions  can  be 
found  by  simple  threshold  tests  on  aggregate  queue  lengths  Specifically 
with  -  S?m-Slm  (the  additional  cost  of  using  link  2)  the  destinations 
are  ordered  according  to  ^  >  0  2  >. . .  >  PK  >  0  >  p  R+1  > . . .  >  pM  (note  that 
traffic  to  destinations  1  and  K  prefer  to  use  link  1).  Thresholds  are 
calculated  using 


X:(k)  =  Xx(k+1)  ♦  Yl(k)  [3k'Pk+1]  for  k  =  K .  1  (12) 

with  X^K+1)  ^  PK+1  Yl(K)  and 

X2(k)  =  X2(k-1)  ♦  y2(k-l)  (pk_1-pk)  for  k  =  K+l .  M  (13) 

with  X2(K)  -  -gKY2(K).  The  Yj(k)  denotes  the  capacity  available  on  link  i 
for  emptying  queues  when  the  demand  Rm  for  destinations  m=l  to  k  is 
routed  on  link  1  and  the  rest  on  link  2  (note  that  for  k  t  X  this  means 
some  traffic  will  by  diverted  away  from  its  preferred  link).  The  composite 
queue  lengths  are  next  defined  as 
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(14) 


and  the  test  variable  T^k)  =  (Y2(k)Q1(k)-Yl(k)Q2(k))/Y2(k)  is  found  for  all 
those  k  <  K  which  also  make  Y2(k)  >  0).  Fork  <Kif  X^k+1)  <  Tj(k)  <X^(k)  and 
Q^(k),  Q2(k)  >  0  the  optimum  flows  satisfy 

a)  U.(k)  =  C. ;  u.m  =  0  for  m  >  k 

(15) 

b)  U2(k)  =  C2;  u2  =  0  for  m  <  k 
where  the  composite  flows  are  defined  as 


a  * 
U,(k)  *  i 
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,m  ;  U,(k)  $  l  u.m 

L  ^  m=k+l  z 


(  16) 


whereas  if  T^(k-l)  <  X ^ ( k )  <  T^(k)  the  optimum  flows  must  satisfy  (15a) 
and 

U-,(k)+u^  =  C2  ;  u2m  =  0  for  m  <  k-1.  (17) 
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Vhe  only  difference  between  (15b)  and  (17)  is  that  (17)  permits  a  norw.ero 
For  a  state  on  the  boundary  of  the  admissible  region  (i.e.  when 
Q|(k)=0  for  some  k  or  i)  the  conditions  (15;  and  (17)  must  be  modified 
slightly  In  this  case  the  lb(k)  must  equal  the  corresponding  aggregate 
demand . 

If  the  above  tests  fail  then  for  k  >  K  the  test  variable  T2(k)  = 
(Y^Ck)  Qo(k)  -  Y2(k)Q1(k))/y1(k)  is  formed  for  all  those  k  for  which  y^(k)  >  0. 
For  Q^(k),  Q-,(k)  >  0  if  X2(k)  <  T2(k)  <  X2(k+1)  the  optimum  flows  must 
satisfy  (15)  whereas  if  T2(k)  <  X2(k)  <  T2(k-1)  the  optimum  flows  satisfy 
(15a)  and  (17).  It  should  be  noted  that  (15)  or  (17)  do  not  specify  the 
flows  uniquely  so  many  optimal  implementations  are  possible. 

Coordination  of  Local  Control 


In  order  for  the  node  level  controllers  to  implement  the  local  control 
strategies,  information  must  be  supplied  to  it  about  the  parameters  S™ 
This  can  be  done  for  each  link  exiting  the  node  by  forming  S.m  =  s^  + 
where  s^  is  the  delay  (or  cost)  of  traversing  link  2  to  the  first 
downstream  node  j  and  denotes  an  estimate  of  the  delay  (or  cost)  of 
reaching  destination  m  from  node  j.  These  cost-to-go  terms  must  be 
computed  at  each  node  and  transmitted  to  the  upstream  nearest  neighbor. 
The  way  these  terms  are  calculated  greatly  affects  the  performance  of  the 
overall  network.  Chu^  using  a  capacity  weighted  average  for  a  single 
destination  network  (M=l)  defined 
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(18) 


and  had  to  restrict  the  application  to  non-cyclic  networks.  Whereas  if  Cg 
is  replaced  by  the  excess  capacity  -  C^-u^  the  strategy  can  be  extended  to 
networks  containing  cyclic  paths  since  it  can  be  proven  that  no  cyclic 
flows  will  occur^J,  Sarachik^  using  flow  weighted  averages  (i.e. 
replace  in  (18)  by  u£m)  for  the  multi-destination  case  required  the 
non-cyclic  restriction.  This  restriction  was  removed  in  [4]  by  using  a 
certain  kind  of  excess  capacity  average.  The  question  of  what  is  the  best 
coordination  scheme  is  still  open. 
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CONCLUSION 


This  paper  has  described  the  general  problem  of  using  dynamic  routing 
to  clear  congestion  from  a  congested  network.  A  decentralized  strategy 
was  described  which  utilizes  local  node  level  controllers.  The  optimal  local 
strategies  were  presented  for  2  special  cases  when  traffic  demand  is  con¬ 
stant.  Research  is  being  done  to  extend  the  local  results  for  traffic 
demand  modeled  by  a  Poisson  arrival  process  and  to  obtain  the  exact 
optimum  routing  for  M  destinations  with  L  exit  links.  More  research  is 
also  needed  to  determine  whether  an  optimal  coordination  strategy  can  be 
found. 
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‘An  Effective  IxkaI  Dynamic  Strategy  to  Clear 
Congested  Multidestination  Networks 

PHILIPL  SARACHIK 

Abstract  —The  totulton  ot  a  local  fouling  problem  for  a  congested  node 
with  two  ajlemalor  paths  to  W  destmaiions  ol  a  traffic  network  ts 
presented.  The  solution  shows  that  the  admissible  stale  space  is  diude-d 
into  a  finite  number  of  regions  within  which  a  different  set  of  optimal  flow 
conditions  must  be  satisfied.  A  specific  realization  of  these  optimal  condi 
tions  is  presented  which  can  be  utilized  by  local  node  controllers  in 
decentralized  control  strategies  (or  larger  networks 

1.  IsmtoDurrioN 

The  problem  of  routing  traffic  ihrough  networks  from  nodes  of  origin 
to  destination  nodes  is  an  important  problem  common  to  the  field  of 
traffic  control  and  data-commumcation  networks  Much  of  the  early  work 
dealing  with  the  traffic  assignment  problem  in  both  fields  was  restncied 
to  finding  an  optimum  steady-state  routing  (static  strategics!  under  non- 
congested  conditions  In  1974  Gazis  1 1 1  proposed  that  a  store-and-forward 
network  could  be  used  to  model  congested  transportation  svsiems  In  this 
model  the  delay  due  to  congestion  is  associated  with  the  watting  time  on 
queues  For  a  single  destination  node  and  uvng  the  aggregate  delay  in  the 
network  as  the  performance  criterion.  D  Am  and  Gazis  [2|  obtain  an 
open-loop  solution  by  discretizing  in  time  and  Chu  and  Gazis  |3|  obtain  a 
dynamic  feedback  solution  for  a  simple  network  with  two  alternate  routes 
At  about  the  same  time.  Segall  and  Moss  |4|-(b|  used  a  similar  model  to 
study  the  dynamic  routing  problem  in  congested  muliidesnnalion  data 
communication  networks.  They  assumed  that  the  delays  encountered  in 
transiting  network  links  are  zero,  so  the  resulting  queue  dynamics  are 
governed  by  ordinary  differential  equations  Using  the  maximum  pnnci- 
g'..  they  obtain  a  set  of  necessary  conditions  for  the  opnmal  solunon  and 
propose  a  constructive  algorithm  for  finding  the  opnmal  centralized 
control. 

When  transil  delays  are  present,  the  queue  dynamics  are  governed  by 
differential  difference  equanons  and  ihe  general  centralized  opnmal  dy¬ 
namic  routing  problem  becomes  formidable  since  stale  variable  con¬ 
straints  are  also  prcsenl  Most  of  the  difficulties  inherent  in  the  central¬ 
ized  problem  can  be  avoided  by  seeking  a  decentralized  control  solution. 

The  decentralized  approach  to  clearing  congestion  from  traffic  net¬ 
works  has  been  discussed  in  three  recent  papers  (7|-[9|.  In  (hese  papers 
the  problem  of  finding  real-time  dynamic  feedback  rouitng  for  traffic  is 
formulated  as  an  optimal  control  problem  and  a  decentralized  structure 
for  the  solution  is  utilized  Chu  [7]  considered  a  single  dcsiinanon  acyclic 
network  and  presented  a  solution  based  on  (he  interconnection  of  local 
controllers  which  implement  route  selections  at  each  node  b\  choosing 
among  A'  alternate  roules  lo  the  single  destination  Sarachik  and  Ozguner 
(9)  considered  single  destination  cyclic  networks  They  show  that  the 
optimal  local  routing  algorithm,  when  used  in  their  decentralized  strategy, 
leads  to  loop-free  flows  in  Ihe  network  even  ihougn  cyclic  pjihs  are 
present  Sarachik  |K|  considered  muliidesnnalion  acyclic  networks  and 
presented  a  decentralized  strategy  for  (hat  case  Local  routing  strategies 
for  two  specific  simple  subnetworks  were  utilized  in  ihe  decentralized 
strategy  to  illustrate  the  control  of  a  larger  network 

In  order  to  make  the  decentralized  routing  sirategv  of  | H|  for  muln- 
dcsttnation  networks  more  generally  useful,  iho  cot  respondent  considers 
the  preb'.-rn  of  designing  a  local  feedback  conimilet  lot  a  (spual  n-vfc  of 
a  mulfideslinalion  network  when  the  node  has  two  ceil  link-,  isoc  big  l| 
An  optimization  problem  is  formulated  and  solved  for  (his  hscal  problem 
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when  ihe  demand  for  service  is  constant  If  is  shown  that  ihe  slate  space  is 
divided  into  regions  separated  by  linear  manifolds  The  closed  form 
conditions  for  the  optimal  comrol  are  given  for  each  region  These 
conditions  do  not  specify  a  unique  ophmal  control  so  a  specific  optimal 
realization  is  presented.  This  realization  is  in  the  form  of  a  feedback 
control  which  can  be  easily  implemenied  in  real  lime 

In  using  any  decentralized  control  strategy  to  route  traffic  through  a 
complex  network,  decisions  must  be  made  locally  at  each  node  on  how  to 
best  allocate  traffic  to  branches  leaving  ihe  node  These  decisions  will 
affect  the  traffic  levels  at  downstream  nodes  and  thus  must  be  based  on 
information,  provided  to  the  local  node  controllers,  about  conditions  al 
other  nodes  of  the  network  Questions  concerning  how  the  communica¬ 
tion  between  nodes  affects  the  overall  behavior  of  large  networks  which 
use  decentralized  routing  are  extremely  important  bul  are  beyond  the 
scope  of  this  paper.  References  [7)-{9j  address  some  of  these  questions 
and  indicate  how  local  node  controllers  can  be  utilized  in  a  decentralized 
dynamic  routing  strategy 

II.  Thi  Locai  Routino  PZO»L£S4 

The  demand  for  service  is  the  net  traffic  (low  arriving  at  a  typical  node 
along  inbound  links  and  from  outside  ihe  network  This  demand  is  mci  by 
routing  ihe  incoming  traffic  to  the  exit  links  When  traffic  demand 
exceeds  Ihe  capacity  of  the  exit  links,  queues  build  in  front  of  (he  node 
When  the  capacity  exceeds  the  demand,  the  queue  lengths  will  be  de¬ 
creased  In  this  paper  attenuon  is  restricted  to  the  situation  which  exists 
toward  the  end  of  "rush  hours  "  That  is.  congestion  exists  in  the  form  of 
long  queues  bul  the  demand  is  less  than  Ihe  capacity  of  the  exit  links  so 
ihe  queues  can  be  emptied  and  the  congestion  cleared  out. 

At  a  congested  node  containing  queues  of  traffic  for  U  destinations  and 
having  only  two  exit  links,  the  queue  dynamics  c-n  be  represented  by  (I). 

|5| 

<j~(  t )  =  z"  -  aj"(r)  -  «”(/)  form  =  1.2.  --.M  (I) 

or  in  vector  form 

=  f “i(') 

where  qm  denotes  a  continuous  state  variable  which  approximates  the 
length  of  the  traffic  queue  (measured  in  vehicles,  bits,  messages,  etc) 
having  node  m  as  its  destination;  um  is  the  flow  rale  on  link  /  destined  for 
m  and  rm>  is  a  constant  flow  of  traffic  entering  the  node  destined  for  m  ( r 
i\  the  demand  vector) 

The  problem  faced  »•  he  node  com  roll  et  is  to  choose  the  control 
vectors  u,  and  w2  at  each  nmc  instant  in  order  to  emptv  the  queues  while 
optimizing  some  performance  measure  A  useful  measure  of  this  perfor 
mance  (7j-(10)  the  aggregate  cost  (or  delas)  given  by 

v 

[«“■(  t  )  *  v  r  “7(1  )  •*-  -V;~  u~(  /  >1  Jl  (2) 

'ft  a-  I 
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where  S~  dcmiles  ihe  c'**sl  I  <  v  Jelav)  (V  f  unit  *  if  Hi**  on  link  i  Jo  reach 
decimation  m  The  lime  T  i'  ans  lime  greater  ihan  ihe  iiuk'  I ■  required  in 
empiv  all  queues  al  the  mxlc  Aiih  minimal  um  Note  iltji  ilie  smtinhti 
lion  10  J  due  to  l he  y"  lernix  is  ilie  atarcgjic  dclai  jweuicil  with 
clearing  the  queue*  |l|.  |5|.  while  ihe  oilier  terms  represent  the  aggregate 
coal  (or  d  'lav)  associated  with  reaching  the  destinations  from  ihe  eniranec 
of  the  exi,  links  ( IU| 

l(  ihould  be  noted  that  when  ihe  local  cooirotler  is  uwj  in  a  devcnii.il 
uted  selling.  ihe  coal  parameters  S,m  will  he  competed  of  iwn  portion* 
S~  - 1~  *  p”  where  /,"  la  the  coal  (or  detail  at  traversing  link  i  to  ihe 
rim  downstream  node  /  and  ta  a  coal  (or  detai  l  aaaociaicd  with 
reaching  declination  m  from  node  j  The  node  coniroller  will  receive  the 
information  p“  from  ilv  down>iream  neighbor*  and  update  the  value*  S* 
In  operanon  ihe  node  controller  must  he  able  to  adapt  lo  there  update* 
Thu*,  a  real-time  feedback  volution  of  the  local  optimisation  problem  i* 
cnemial 

In  thia  problem  ihe  queue*  can  be  emptied.  *incc  we  coruider  only  the 
caac  for  which  the  total  traffic  demand  A  -  I“.  ,r“  entering  the  node  it 
leaa  than  the  combined  capacity  of  the  two  exit  linkt  (C, +  f,|.  The 
optimal  rsoncoogriied  com  J‘  can  be  defined  aa  the  minimum  of  (2)  when 
the  initial  ttaic  y <(,)  it  lero  Thu*,  for  the  tame  T aa  in  (2). 

ar 

j'-fT  2  (sr*r +*?•?)*  (j) 

<•  «>i 

where  u“‘  are  the  optimal  noncongeated  flow*  and  yf  ♦  vj  =  r.  The 
minimum  ooat  due  to  the  congcation  at  the  node  it  obtained  by  tubtract- 
iog  /•  from  the  minmum  of  (2).  Thia  gives 

*'(«('*))  -  min[y{g)-y] 

(id 

w 

(*) 

where  die  mininuxaiioo  it  carried  out  subject  to  the  conatrainu 

v 

?(r)>0;  y.(r)*0:  J  «,"(')«?.  for, -1.2:  ,0*,*T 

m  »  I 

(J) 


and 


Thcae  contirainia  impose  ihe  require  menu  that  queue  lengths  and  (Iowa 
mutt  be  nonnegative,  (hat  each  link  ha*  a  finite  capacity,  and  that  (he 
queue*  null  be  empiv  al  ihe  terminal  lime.  Noic  (hat  for  a  given  yd,,) 
and  any  T  » r*  ( T  could  be  *  |  Ihe  name  minimum  co*l  I  it  obtained,  to 
V  doc*  not  depend  on  the  limc-io-go  \  T  -  /,).  Abo.  »y*iem  <(|  i>  lime 
invariant  and  the  coefficients  in  (2)  arc  contlam.  Therefore,  r  i*  indepen¬ 
dent  of  the  jlarttng  lime  r.  Applying  Hamition-Jacobi  theory,  we  know 
that  the  minimum  cost  ^1  if  I  for  *y*iem  (I)  satisfies  ihe  partial  differential 
equation 


mm 
•t  n 


A# 

l  [»"(»)♦  jr(*r(f)-*r*)**r(*r(»)->r’)] 

•9  -  I 

♦^^(r-u,(r)-«.(,)|}  =0  (6) 


where  ihe  minimuaiion  is  \uhjcci  io  coniiraini*  (5)  Using  r  =  «•  «■  uf 
ihi*  equation  can  be  rewritten  a* 


(ihe  dependence  of  y~  and  v,"  on  ,  i*  not  indicated  explicitly)  Noie  ihai 
vmce  (7)  ii  linear  in  ihe  »,'*.  the  minimum  lie*  on  ihe  constraint 
houndane*.  and  uncc  two  of  these  boundaries  involve  sums,  an  optimal 
solution  need  not  he  unique 

By  finding  ihe  kTjl  which  satisfies  (7)  and  the  boundary  condition* 
imposed  by  the  tiale  cotulrainU.  we  can  obtain  explicit  neceiaarv  and 
sufficient  condition*  for  ihe  optimal  flow* 


til.  Optimal  Flow  Conditions 

In  this  section  the  conditions  (or  optimal  flow  art  presented  A  detailed 
description  of  bow  these  conditions  are  derived  is  given  in  |ll|  To 
simplify  the  presentation,  the  destinations  are  numbered  so  that  for 

0m*S?-Sr 


With  this  numbering,  traffic  to  destinations  I  through  K  prefers  link  I 
while  link  2  is  preferred  by  traffic  to  destinations  K  +  I  through  A/  Also, 
if  it  becomes  necessary  to  divert  traffic  from  link  I  to  link  2.  traffic  to  A 
ihould  not  be  diverted  until  all  traffic  to  m  >  k  has  been  diverted 
Conversely,  when  traffic  must  be  diverted  from  link  2  to  link  I.  traffic  to 
k  should  not  be  diverted  until  all  traffic  to  m  <  k  has  been  diverted 
It  is  convenient  to  define  the  excess  capacities  as 

T,( *)*<:,-  2  -r'i  2  «r  (») 

*•1  W-l 


*1*1 


Ta(*)4i 


T,(*)  *  1 


O.Y,«)- 


2  -r 

♦  i 


o.  y,(K)~  2  if* 

•  -  ff  ♦  I 


tor*-0.  -,/C-l  (9) 

:  Ti(*)*T-Ti(*) 

for  A  —  A‘  +  I .  •  .  A/  (10) 


where  r  4 Ti(  K I  +  Tj<  K  I.  Note  that  (*)  defines  the  capacity  available  on 
each  link  for  clearing  queues  after  ihe  steady-siaie  demand  is  accommod¬ 
ated.  when  no  sicidy-iiaic  flow  is  diverted.  When  f ,  and  C:  are  each 
targe  enough  to  accommodate  the  preferred  demands,  then  u,“'  =  rm. 

-  0  for  »r  <  K  and  -  0.  for  m  >  K.  and  both  y,(  K )  and 

y.(  K  )  are  greater  than  aero  If  onr  nil  link  has  insufficient  capacity  C,  to 
accommodate  it*  prtfrrrtJ  traffic,  then  as  much  preferred  traffic  a* 
possible  is  routed  on  this  link  (with  prion  lie*  governed  by  the  )Sm)  giving 
y,(  A'  )-0  and  the  rest  i*  diverted  io  the  other  link.  It  it  ihu*  quite  simple 
io  determine  (he  value*  v”’  for  ail  m  and  ,  for  ihi*  problem  and  to  form 
ihe  veil  y,  —  |  y,(0).  . y,(  Af  I)  lor  ,-1.2  u*tng  (HI.  (9).  and  (10)  The 

y,(A|  and  y,(  A I  value*  from  (9)  are  ihe  cxcest  capacities  available  (or 
emptying  queue*  when  demand  for  destinations  (A  »  I)  lo  A  is  diverted 
from  link  I  io  2.  whereas  the  value*  from  (Id)  are  ihe  excel*  capacities 
when  the  demand  for  dolma  (ion*  A"  ♦  I  to  A  ta  diverted  from  link  2  (n  I 
Divtruon  of  traffic  front  one  kak  to  another  can  eventually  use  up  ihe 
capacity  of  the  link  receiving  the  diverted  traffic  To  keep  track  of  this,  wt 
define 

A  -  ihe  smallest  value  of  A  >uch  that  y.-(  k  )  >  0 
A  =  the  largest  value  of  1  such  that  y,(  A  )  :»  0 
A  =  mm  (  A  .  A  ] .  A  -  mas  |  A  .  A  ) 


(II) 
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A  sci  of  threshold  constants  can  now  be  defined  recursively  hs 

*.<*)  =  *.(*-')♦  Y, (A  )|0,- /I*.,  |  forA  =  A.  .1(12) 

with  X,(  K '  *  1 1  -0A  .  i y,(  A  )  (so  Y,(  A  )  >  0  is  nondeereasini  as  A  decreases 
from  A  to  I )  and 

*»(*)  =  *j<*- I)*  Tj(* -!>(**-, -&1  for  A  =  Af  +  I .  M 

<«») 

with  Aj(  A )  -  -  fff  7j(  A )  (so  A;(  A ) »  0  is  nondeereasini  as  A  increases 
from  A  +  I  to  Ml  Composite  queues  are  defined  as 

i  H 

<?.(*)4  2  0m.  <?a(*)  -  2  ?“  =  <?-<?,(*)  0<) 

m  «  I  m  *  k  ♦  I 

where  0  is  the  sum  of  al)  queue  lengths.  The  composite  flows  on  each  link 
arc 

i  v 

i/,(*)4  2  «(*)-  2  (IS) 

*a|  «•!♦! 

Note  that  A  =  0  means  @,(0)  =  0  and  f/,(0)  =  0  while  A  =  V  means 
Q1(M)  =  0  and  f/2<  V )  =  0:  for  notations!  convenience  the  explicit 
dependence  of  these  quantities  on  r  has  been  omitted  The  upper  flow 
bounds  are 


When  the  state  sector  q  lies  in  a  region  A,,  optimal  flows  must  saiisfs 
the  condition 


(  r'“‘  T< 


(*)  (?;(*) 

(<)  Y2<*) 


In  (19)  (recall  that  A  is  the  number  of  destinations  which  prefer  link  I).  if 
A  =  0  the  upper  bound  is  xero  but  if  A  =  M  the  lower  bound  is  zero 

r,(A)>Jtr,(A)  for  A  =  A  <  A' 

r,(A-l)<  A,(A)<r,(A)  for  A  <  A*  A 

r,( A  -  I)  <0  foe  A  =  A* -i- 1 «  A 


0»*  - 1 =  {9 


T,(A  -  I)  <0  for  A  =  A  +  I  <  Af 

r,(  A )  «  0  for  A  =  4  »  A  +  I 

r5<  A  )  «  JT,(  A  )  <  ra<  A  -  I )  for  A  <  A  <  A 

T,(  A  -  I )  >  Aj(  A  )  for  A  =  A*  +  I  >  A'  +  I 

(20) 


The  region  in  which  q(i)  lies  and  ihe  index  value  A  is  obtained  by 
determining  which  of  the  inequalmes  1 1*).  (19).  (20)  ihe  slate  vector  q(i) 
satisfies 


(/,(«)  =t/,(A): 

and 

</»<0)«t/,(  AJ- 
and  the  restriction 


=  C,(  A)  if  <?.(  *  )  =  0 

C,  otherwise 


.)  if<?,(A)  =  0 

otherwise 


a," +  when  qm  =  0  (21c) 

Note  that  £/,(  Af )  =  C/,(  A )  means  that  u|"  =  0  for  m  >  A  and  l/2(0)  =  £A(  A  ) 

means  u“  =  0  for  m  <  A 

If  the  state  vector  f  lies  in  a  region  A*  *-,,  then  optimal  flows  must 
satisfy  (21a)  and 


£2,(0)=^(A-I): 


:-\)  if<?,(A-l)=( 

otherwise 


C,(A)*min|c,.  2  Cj( A )  -  mtn  |c2 .  2  <»*> 

The  solution  of  the  Hamilton- Jacobi  equation  (7)  shows  (II)  that  the 
admissible  state  space  q  »  0  is  divided  into  disjoint  regions  which  fill  the 
space.  In  each  region  V(q)  is  given  by  a  different  quadratic  function,  but 
it  is  continuous  across 'the  region  boundaries.  These  regions  can  be 
expressed  conveniently  in  terms  of  variables 

T,(  A)  *<?,(*  )-^j<?2(A)  defined  (or  A  <  A 
7i(A)*(?2(A)-I^(?1(A)  defined  (or  A  <  A'  (17) 

The  regions  are 
*** 


A,(A  +  I)  <  T,(  A )  <  A,(  A )  for  A  <  A  «  min ( A  -  I .  A* ) 

Afj(  A )  <  T2(A )  <  JT2(A  +  I )  for  ma*(  K  +  I .  A  )  <  A  <  k 


as  well  as  restriction  (21.*)  Note  that  actually  the  only  difference  between 
these  conditions  is  that  (2  Id)  allow:  u2  to  be  nonzero  (this  is  rrquirtd  onlv 
when  Qj(  A  -  I )  *  0)  so  traffic  to  destination  A  could  be  routed  vu  both 
links  I  and  2.  Since  these  conditions  do  not  specify  the  optimal  flows 
uniquely  any  implementation  which  is  convenient  will  be  optimal  These 
conditions  are  necessary  and  sufficient  for  optimality  because  they  were 
obtained  from  V(q )  which  satisfies  the  Hanulton-Jacobi  equation  (7)  and 
boundary  conditions  imposed  by  the  state  constraint  q  >0  (Reference 
1 1 1)  gives  expressions  for  the  minimum  cost  V(q)  in  each  region  and  a 
proof  that  these  regions  fill  the  admissible  state  space  q>  0  ) 

The  region  in  which  q  lies  could  be  determined  by  using  a  micro¬ 
processor  or  a  special  chip  which  implements  the  conditions  ( IK)-(20)  in 
panM  by  comparing  the  vector  of  lest  variables  against  the  vector  of 
tbnmheld  values 

IV.  A  Specific  Optimal  Routing 

In  the  preceding  section  the  general  requirements  (21)  permit  many 
optimal  routing  strategics.  In  this  section  a  specific  optimal  routing  is 
presented  which  satisfies  these  general  requirements 


For 

sutes  lying  in  the  regions  Akk  consider  (he 

*1 

*k 

•) 

(•*) 

b) 

.r.o:  ■r-.r*.r--t,(.)^ 

(19) 

for  A  +  I  <  m  <  M 
(22) 


where  for  Q,(  A )  =  0  or  Qt(  A  )  *  0  the  indeterminate  term  0/0  is  defined 
to  be  zero  (thus  (21c)  is  satisfied  when  q“  =  0)  We  now  verify  that  (22) 
satisfies  the  requirements  (2la)-(2lb).  For  p,(A)>0  and  @;(A)>0. 
summing  (22)  gives 

1  1 

*)  u.m*  2  ■."*+  2  +  y,<*) 

m  *  I  at  •  I 

U  V 

b)  l /,(*)=  2  2  +  72(  A )  (23) 

•  »  *  *  I  m  »  k  ♦  I 

Now  y,(A  )  >  0  and  y,(  A  )  >  0  (this  is  necessary  for  A,,  10  exist  1  so  for 
A  <  Af  we  have  =  0  for  1*1  <  A  and  1/"'  =  0  for  m  »  K  *  I  Thus. 


»>  ^.(*)=  2  -r'^Y,(*)  =  T ,<*>♦  2  -r'  =  c, 


itfcf  r  ran  sections  on  automatic  control,  vol.  ac  27.  no  2.  april  1912 


x  H 

b>  i /,<*>=  2  +  2  “r‘  +  T,(*>  =  y,<*> 

m  —  k  ♦  I  m  =  4  ♦  I 

M 

+  2  -r‘=Q  w 

«t  -  H  I 

For  A  >  K  this  is  demonsiraicd  in  a  similar  manner  When  0,<A)-  0 
the  y,(  A  )  term  does  not  appear  so  (23a)  pves 

k  k 

2  (-r‘  +  -?*)=  2  r-=c,(4) 

m  •  1  MI  ■  I 

since  y,<  *  )  >  0  Similarly  when  Q2(  k )  =  0  the  y2(  * )  does  nos  appear  in 
(23b) 

M 

u}(k)=  2  '--«»(*>■ 

For  slates  lying  in  the  regions  <t  l_  ,  the  situation  is  complicated  hy 
degeneracies  which  can  occur. 

A  composite  expression  which  gives  optimal  (lows  for  ?  in  Aa  t . ,  is 


a)  uf  =  uj-’  +  r.(*)- 


u7  =  0  for  I  <  »i  <  A  -  I 


b)  «f  =  uf‘  +  rl(*)^-r  +  A(*) 


c)  -*-"+r*(*)wh irA(t) 


d)  vf=o;  ur  =  ur’  +  «r‘+r,(*) 


for*-H*m«M  (25) 


where  the  indeterminate  term  0/0  is  defined  to  be  mo  when  either  <?,(  A ) 
or  Qt(k  -  I)  is  zero  so  again  (21c)  is  satisfied  when  g"  =  0.  The  symbols 
used  in  (25)  are  denned  as 


r,(*)ftT,(*-l):  r,(*)*Ti(*-l)  1  | 

A(*)~ -Tj(*);  focAWAj 

r.(*)=y,(*).  r,(*)*T,(*)  1 

(*  +  l)*T.<*):  A(*)4«r  for  A  v*  A  +  I  J 


tor  k  *  K 


for  A  >  K 


Wc  now  venfy  that  (25)  gives  U,(k)-C ,  and  l/2tA-l)  =  C2  which 
satisfies  the  conditions  (21a).  (2ld)  for  an  optimum  When  *  =  *  >  A.'  or 
*  =  k  +  I  «!  K  or  k  is  in  the  range  A  <  k  <  *  +  I,  the  T,(*  I  are  positive  so 
the  result  can  be  shown  exactly  as  before  When  k  =  A  *  K  and  p,(  * )  >  0 
we  find  by  summing  (25)  that 

t  *-• 

4)  t2,(*)=  2  u,"’+  2  •'"*  +  Ti(A-I)-Ti(*) 

m  ■  I  m  *  I 

H  H 

b)  2  *'r'+  2  «r'+T,(*-i)+Tj(*)  (26) 

m *  4 ♦  I  mM  k 

Since  y2(  *  > >  0  we  have  v~"  =  0  for  A  >  K  +  |  and  since  y2(  *  -  I )  =  0  wc 
have  T,(  k  -  I )  -  y  >  0  and  u =  0  (or  m  <  A  -  I  Thus  using  (R)  and  (9) 

k 

t/,(*)=  2  ■r’+T-nm-c, 


IA(*-I)=  2  2  "f  +  T2<*)  =  C2  (27) 

m  *  »  ♦  I  —  »  1 

Tins  resu1!  can  be  ootaincd  in  a  similar  manner  for  4  -  *  *  I  >  A 
To  show  ihat  (25)  is  opnmnm  when  Q:(k  —!)-()  *t  “ill  show  thai 
when  this  happens  the  state  q  cjn  only  be  in  A,  , . ,  with  »  «"■  A  and  we 


have  )usi  demonstrated  ihji  in  this  case  ihe  composite  flows  (26)  are 
optimum  Bs  definition  (I4|  for  ihe  composite  stales  note  that  if 
Q .1  A  1 1  =  0.  then  (l  I  m  I  II  and  yt(  m  )  .  i  1 1  (or  all  m  >  A  -  I 

For  A  -  A  (20|  require-,  ihal  y,(A  -  llt/iA  l)<  -  y,t  A  l)A'.(A| 
which  is  clcarls  impossible  since  y,(A  -  1 1.  y,(  A  -  ll  and  X ,(  A  |  cannot 
be  negative  For  A  -  A  «.  A  (20)  requires  the  impossible  condition  @,(A 
-  I )  w  A ,( A  )  «"  0,(  A  -  ll  \Ae  conclude  therefore  ihal  a  slate  q  with 
Q2(k  —  I )  =  0  can  saiisfs  ihe  conditions  < 20 )  for  At,  ,  ,  onls  when 

A  =  A  *  A'  A  similar  argument  shows  ihal  when  P,(A  |  =  0  me  stale  can 
be  in  4 ,  only  when  A  =  k  *  I  >  A' 

V  Conclusions 

The  problem  of  finding  an  optimal  rouiing  strategy  for  local  node 
controllers  in  a  mulndcslination  network  has  been  solved  for  ihe  case  of 
nodes  with  two  exit  links  and  constant  demand  It  has  been  shown  that 
ihe  admissible  slate  space  is  divided  into  control  regions  separated  by 
linear  manifolds  and  explicit  expressions  for  the  boundaries  of  these 
regions  arc  presented  To  each  region  ihcrt  exists  a  set  of  controls  which 
arc  optimal  for  the  stairs  of  that  region  A  specific  feedback  realization  of 
the  optimal  control  has  also  been  presented  The  implementations  (22) 
and  (25)  tor  the  optimal  flows  are  not  the  onlv  ones  possible  They  are. 
however,  convenient  for  traffic  control  purposes  because  they  provide 
some  exit  capacity  to  traffic  for  each  of  the  destinations  and  the  longer 
ihe  queue,  the  larger  will  be  its  share  of  the  capacity  They  can  also  be 
readily  implemented  using  a  microprocessor  since  they  consist  merely  of 
Ihe  sum  of  steady-state  terms  and  a  linear  feedback  on  the  ratio  of  a  slate 
variable  to  tlie  composite  state  Q,  of  which  g"  is  a  part  The 
steady-state  values  must  be  available  at  the  local  controller  either  by  local 
computation  or  they  must  be  supplied  by  a  central  supervisory  controller. 
Note  also  that  if  either  the  demand  or  the  exit  capacity  changes  to  new 
constant  values  then  the  y,  vectors  change,  so  that  the  boundaries  of  the 
control  regions  and  the  feedback  gains  change  This  merely  requires  a 
redeterminahon  of  the  region  in  which  q  lie*  The  feedback  rouiing  given 
is  therefore  adaptive  to  changes  not  only  in  demand,  but  in  capacity  as 
well 

It  should  be  noted  that  th-  solution  [or  r  constant  may  not  oe  as 
restrictive  as  it  appears  For  example  r  could  represent  the  average 
demand  over  some  interval  so  it  would  be  constant  on  that  interval 
Furthermore,  the  routing  (22)  gives  constant  (lows  «,"*(()  as  long  as  the 
state  remains  in  one  region  The  routing  (25)  gives  almost  constant  flows 
in  one  region  Thus  even  in  decentralized  operation,  when  all  nodes  use 
this  local  strategy  the  incoming  demand  can  be  considered  as  piecewise 
constant  over  intervals  The  feedback  routing  could  track  the  changes  hy 
updating  the  y,  vectors 

For  data  communication  networks,  the  problem  solved  here  could  be 
applicable  when  there  are  delays  or  costs  associated  with  sending  mes¬ 
sages  to  the  destinations  Of  course,  il  is  usually  not  desirable  lo  mix  the 
messages  for  the  various  destinations  on  a  single  link,  so  a  different 
specific  implementation  would  be  preferable  in  such  networks 
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Abstrsct 


A  dynamic  alternate  route  strategy  ia  developed 
for  a  typical  node  of  a  traffic  network.  the  strate¬ 
gy  compare!  the  queue  length  at  the  node  against  an 
optimal  threshold  value.  •lien  toe  threshold  is 
exceeded,  some  of  the  traffic  is  diverted  along  an 
alternate  more  costly  route  to  its  destination. 

The  local  node  level  strategy  is  easily  im¬ 
plemented  in  real  time  and  is  also  adaptive  to 
changes  in  route  capacities,  input  rates  or  the  rel¬ 
ative  delsys  along  the  alternate  route.  It  it  also 
suitable  for  use  in  a  decentralized  routing  strategy 
for  large  networks. 

Introduction 

The  store  and  forward  network  siodel  has  been 
found  to  be  very  useful  for  problems  concerned  with 
moving  traffic  through  congested  networks  from  origin 
to  destination  nodes.  With  this  model,  whenever  the 
instantaneous  traffic  demand  exceeds  the  capacity  of 
the  network  to  use  the  demand  for  service,  congestion 
builds  in  che  network  in  the  form  of  queues  which  are 
scored  at  the  network  nodes.  For  a  fixed  network 
structure,  the  problem  faced  at  each  node  is  how  to 
choose  the  routes  along  which  to  send  traffic  on 
toward  its  destination  while  minimizing  some  measure 
of  cost  or  delay  and  satisfying  the  capacity  con¬ 
straints  of  the  network  links.  For  stochastic  traf¬ 
fic  demaod  most  of  the  recent  work  has  emphasized 
dynamic  or  adaptive  routing  schemes  i  1  -o ]  which  use 
Information  aoout  the  ictual  state  of  the  node  or 
network  to  determine  the  best  routing.  In  almost  all 
this  work  s  node  with  L  routes  to  the  destination  is 
modelled  as  L  parallel  M/M/ I  oueues.  The  routing 
decision  is  made  at  the  time  a  customer  arrives  by 
placing  him  in  one  of  the  l  queues.  Foschini  ind 
Salz(2]  show  that  for  high  traffic  demand  a  join- 
the-shortest-queue  rule  gives  average  queueing  delays 
approaching  that  of  the  M/M/L  queue  (i.e  average 
delay  is  reduced  by  a  factor  of  L  compared  to  equal 
splitting).  foe  two  alternate  routes,  Boorstyn  and 
Livne  [6J  also  get  performance  approaching  the  N/M/2 
case  by  adding  a  third  queue  which  con  use  eif-er  of 
the  two  servers  wbea  the  dedicated  queues  become 
empty. 

It  would  seem  that  a  more  direct  approach  to 
achieving  such  reduced  queueing  delays  at  the  a  node 
la  to  operate  the  node  uaing  a  single  queue  with  L 
possible  servers.  The  routing  decision  can  then  be 
put  off  until  the  moment  a  customer  gets  to  the  nead 
of  che  queue  and  must  be  put  into  service  In  Fig  1 
the  box  0  denotes  a  decision  rule  wmen  allocates 
traffic  to  the  route  servers.  'he  decision  can  then 
be  cade  on  the  basis  of  the  latest  slate  information 
and  also  on  the  delays  to  be  encountered  along  the 
different  routes  { 1 ) . 

This  problem  has  been  solved  fo<  the  case  of 
deterministic  input  demaod  (7,8|,  where  it  was 
found  that  the  optimal  routing  strategy  is  a  thres- 
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hold  strategy.  The  purnoie  of  thia  paper  is  to 
investigate  the  threshold  strategy  for  a  discrete 
queue  dynamic  model  with  rap.nom  Poisoon  arrivals. 


Fig. I  A  single  queue  alternate  route  model. 


A  Threshold  Strategy  for  Local  Alternate  Routing 

Consider  any  aource  node  and  a  single  desti¬ 
nation  node  with  L  routes  between  them  indexed  by  i 
x  l,  ...,  L.  The  problem  is  to  assign  traffic  to 
these  routes  in  order  to  minimize  the  average  delay 
in  reaching  the  destination. 

We  assume  that  traff-c  arrives  at  an  average 
rate  X  with  Poisson  intersrnval  times  and  :nat  the 
length  of  each  unit  of  t.affic  is  exponentially 
distributed  with  mean  l/y.  (For  meiaages  the  units 
are  in  biti  and  for  vehicles  the  units  are  in 
feet).  All  traffic  arriving  at  the  node  ia  as¬ 
signed  to  a  (ingle  queue  and  the  decision  on  which 
route  to  use  is  made  only  when  it  reaches  tie  Croat 
of  the  queue.  We  have  then  the  situation  of  a 
single  queue  with  many  potential  servers  as  shown 
in  Fig.  I.  Letting  Q  denote  the  total  numuer  of 
customers  (messages  or  vehicles)  waiting  in  the 
queue  and  in  service,  Little's  formula  TQ  =  EIQl/.V 
gives  the  average  time  spent  on  the  queue  and  in 
service.  Furthermore  all  traffic  using  the  .  th 
route  will  encounter  ao  additional  average  delay  ot 
S  before  reaching  the  destination.  This  addi¬ 
tional  delay  includes  both  transit  delays  if  any 
and  queuing  and  service  delays  at  nodes  alon;  the 
route  to  the  destination.  The  average  total  delay 
to  reach  the  destination  is  thus  given  by 

T 1  r  *  r  *  \  3i  (1) 

1=1 


Where  A  is  tbe  average  rate  ar  which  traffic  is 
sent  along  route  i.  rot  a  detereinist  it  version  or 
this  prooiem  with  const jnt  arrival  net,  the  opti¬ 
mal  routing  strategy  was  found  to  be  a  threshold 
strategy  i  /  J  That  is  to  say  wnen  the  routes  are 
numbered  .o  that 


s,  1  s,  i  ■■  <_  SL  (2) 

then  tbe  optical  Jtrattfgy  to  me  i  :ute  i  ot  fall 


capacity  if  and  only  if  Q  exceeds  *  threshold  K 
(where  0  *  K  <  k,  <  ...  <  K,).  Tbit  means  mat  for 

K.  <  Q  <  K  ,  onl9  the  first  I  routes  should  be  used, 
i  -  r»  1 

For  very  small  Q  oaly  route  1  (the  one  with  smallest 
delsy  is  used  tod  ss  Q  incretses  the  next  best  alter- 
atte  routes  tre  tdded.  Work  is  presently  being 
carried  out  to  prove  the  optiotlity  of  this  type  of 
strategy  for  the  stochastic  problea  considered  here. 
On  a  heuristic  basis  one  would  expect  a  threshold 
strategy  to  be  optical  since  the  first  tera  of  (1)  is 
saall  when  all  routes  are  used  and  the  second  tera  is 
minimized  when  only  route  1  is  used.  For  very  saall 
values  of  Q  it  should  be  best  to  use  only  route  1  but 
wheo  Q  reaches  a  level  at  which  the  increased  cost  of 
using  the  next  best  route  2  is  oftset  by  the  aaount 
that  Q  can  be  reduced,  then  route  2  should  also  be 
used.  The  saae  rationale  for  larger  values  of  Q 
leads  to  the  threshold  strategy  outlined  above.  This 
Is  the  type  of  routing  policy  which  is  considered  in 
the  following  discussion. 

Substitute  A  *  p.  P  into  (1)  where  pi  *  the 
service  rate  for  route  \  (Vi .  *  p  C.  where  C.  is  the'' 
capacity  of  the  first  link  in  route1  i)  and  P^  is  the 
probability  that  route  i  is  used.  We  seek  to  mini- 
aixe  the  cost 


J  »  AT  a  £{Q}  ♦  2  S.  p.  P.  (3) 
ial  1  1  1 

by  choosing  the  threshold  levels  X  <  X,  <  ...  <  X.. 
Rote  that  K  a  o  because  whenever >  0  some  service 
swat  be  provided  so  route  1  (defined  as  the  route 
with  the  smallest  delay)  should  be  used.  The  rela¬ 
tion  between  the  cost  and  threshold  levels  is  com¬ 
plicated  In  the  general  case  of  L  alternate  routes  so 
we  will  proceed  with  the  case  of  l  *  2  to  demonstrate 
the  approach. 

Two  Alternate  Routes 

We  first  will  consider  a  simplified  situation 
(admittedly  unrealistic)  where  swapping(S)  is  allowed 
between  the  two  servers.  That  is,  if  server  2  is 
busy  when  server  1  completes  a  service  which  results 
in  Q  <  K  then  the  customer  being  served  fay  server  2 
has  the  service  completed  by  server  1.  This  results 
in  the  one  dimensional  state  transition  diagram  shown 
in  Fig.  2.  When  swapping  is  not  permitted  (NS)  then 
any  service  begun  by  a  server  must  be  completed  by 
the  same  server.  The  threshold  strategy  leads  then 
to  the  state  transition  diagram  of  Fig.  3  where  the 
states  tabled  (ka)  denote  that  k  customer*  are  in  the 
system  but  only  server  1  is  being  used  to  provided 


CCGOEC-  •  •  •'^OSC  •  •  • 


«  * 


Fig. 2  State  transition  diagram  for  S  mode! 
using  threshold  K. 

For  Fig.  2  we  find  that  the  S  model  is  charac¬ 
terized  by 

AP(k-l)  »  Pj  P(k)  for  1  <  k  <  K 


AP(k-l)  S  P( k  p  for  K+ 1  <  k 


where  ?(k)  £  ?rob(Q  *  k|.  Note  that  only  route  1 
(server  1)  i>  used  for  I  <  C|  <  K  whereas  both  routes 

(servers  1  and  2)  are  used  for  Q  >  K*l.  The  set  of 

equations  (A)  are  easily  solved  to  give 

Y*  P(0)  for  C  <  k  <  K 

P(k)  »  *  „  *  ~  (J> 

t2  Y,  P(0)  for  K+)  <  k 

where  y,  6  A/p}  and  Yj  $  V(Pj  ♦  Pj)-  Summing  (5) 

over  all  k  and  equating  to  1  gives  for  Yj  t  1 


(1-T,) 


h 

»-Y,  J-Tj 


where  a  £  • 

Since  route  1  la  uted  whenever  Q  >  0 

X 


Pj  «  1  -  P(0) 


for  Yj  t  1 


Equation!  (6)  and  (7)  and  those  to  follow  are 
valid  far  Y(  >  1  well  at  Yj  *■  1  provided  y2  <  1- 

Analogous  equations  for  y1  *  1  can  be  easily  ob¬ 
tained  for  the  entire  development  presented  nere. 
Route  2  is  used  only  for  Q  >  K,  so  for  Yj  F  1 

X  (Yj-OJY,* 

P,  ■  1  -  I  P(k)  «  — i - et-  (8) 


Using  E (Q }  ■  I  k?(k)  with  (3)  and  (6)  gives  for 

k*0 


Yj  F  I. 


Yi  /Y,0-Yj)  Yj  \  J  1 

1-Yj  \0-Yj)2  O-Yj)2  /  1  J 


Let  R  denote  the  routing  cost  associated  with 
(3).  Then  (7)  and  (8)  give  for  YjPl 

_  A  -  .  -  .  S  p  (Yj-aY*)*S2M2(Yj-o)Y5 

R  “  S1  *1  VS2  ^2  P2 - 7— K - 


We  observe  that  at  K  *  m,  E IQ  1  in  (9)  in¬ 
creases  monotonically  and  R  in  (10)  decreases 
sraootonicslly  so  there  is  some  value  K*  for  which  J 
*  E{Q)  ♦  R  has  a  minimum.  This  value  could  be 
found  by  a  direct  search  or  its  approximate  value 
can  be  determined  by  finding  expressions  for  the 
changes  AE(Q)  and  AR  as  functions  of  K  The 
optimum  value  K*  will  be  an  integer  near  tbe  value 
X  at  which  AE  ♦  A  R  =  0.  In  this  case  we  find. 

.  Yj«  i  *  Mj*M2 

K  ♦  s  (W(VX>  +  *  J^A 

*  1  1  1  1 1 1  i 


When  Y,  4  1  then  neglecting  the  second  term  on 
the  left  of  (Hi  gives  a  starting  point  for  a  one 


I 


(W) 


search  tor  K.  Whoa  j,  >  1  since  the 
terns  on  the  right  ere  all  negative  we  find  that 


K  > 


(x  \  x*pi 

(S,-S.)(X-M.)  ♦  -  ♦  —  —  ) - - 

,  X-M,  W^Mj-X/  Xfl  j 


1“  Y, 


(12) 


so  the  right  side  of  (12)  can  be  used  to  start  a 
one  dimensional  search  for  K*. 


Fig.  3  Slot*  transition  diagram  for  N  S 
model  with  threshold  K. 

Tor  the  NS  nodal  where  swapping  is  not  allowed 
we  find  fron  Fig.  3  that 

(s)  X  f (0)  •  ilj  P4(l)  ♦  Wj  p(l) 

(h)  A[p(k-l)+p  (k-1) J  «  M, [p(k)*p  (k) ]  ♦  p,  p(k) 
for  2  <  k  <  K  *  1  *  * 

'  ‘  03) 

k 

(c)  Xp(k)  ♦  (i,  I  f(J)  •  p  p(k*l)  for  1  <  k  <  E 

J*1 

(d)  X  p(k)  «  (M,ni2)  p(k*l)  for  Eel  <  k 
(o)  X  (p(E)  ♦  pa(K)J  -  (Mjtij)  POC*l) 


for  1  <  k  <  K 

4 

where  b.  *  1  and  v-  m  Mj/M,  •  Or  for  easier  cal¬ 
culation  define 


k 

dd  Zb.  and  iterate 
*  Jel  J 


‘k  "  -k-|  «  bJ 


(1«) 


r»l 


h  \  *  *3  dk 


for  1  <  k  <  E 


starting  with  da  ■  0,  bj  «  1.  Also  since  P(k)  *  p(k) 

*  p  (k)  ( Note  P(k)  «  p(k)  for  k  >  E»l)  eq.  (13b) 
witlf  (Id)  gives 


Kk) 

*  h 

Kk-1)  -  Tj 

bk  p(l)  for  2  «  k  <  E 

09) 

where  from  (13a) 

Kl) 

•  tj  K0)  ♦ 

(1-Y3)  p(l) 

(20) 

The 

solution  of  (19) 

can  be  written  as 

Kk) 

*  *1 

(k*l)Kl)  - 

v  fVv  (k*‘‘J>  h  1 
y3  bj^lJ 

p(l) 

for  k  ■  2,  ....  E. 

(21) 

so  by  defining 


k-1 

ft* 

J»1 


y  Ck-l-J) 
•1 


>1 


we  can  Iterate 


for  1  <  k  <  E 

(22) 


whore 

p(k)  ■  Prob  {Q  e  k  and  server  2  is  busy} 

Pa(k)  «  Prob  {Q  ■  k  and  server  2  is  not  busy} 

Eq.  (13d)  con  be  solved  to  give 

TOO  •  p(K*U  for  k  >  *»1  (14) 

where  p(E+l)  is  obtained  fron  (13e)  once  p(E)  and 
p  (E)  arc  known.  Equations  (lla-c)  give  2X  equations 
for  the  renaining  (2K+1)  unknown  P(0),  p(l),  ... 

p(X) ,  pa(l),  ...  Pa(K). 


k*l 


*1  \  *  Vl  f"  1  -  k  -  K’1 


(23) 


starting  with  a  ■  0.  Thus  we  see  that  (20)  and 
(21)  becone 


T(h)  -  y,k  P(0)  ♦  v^'^Cl-tj)  pCD-Yja^ 
for  1  <  k  <  E. 


p(l) 

(24) 


Thus  all  the  P(k)  can  be  expressed  in  terns  of 
p(l)  and  P(0).  But  p(l)  itself  can  be  expressed  io 
terns  of  P(0)  because  (13e)  and  (16)  gives 

Y2  P(E)  *  p(E+l)  *  bK+1  p(l)  (25) 


The  additional  condition  needed  to  solve  is  as 

usual 

m 

l  Kk)  «  1  (15) 

k*o 


P(E)  fron  (24)  in  (25)  gives 
E 


?Cl) 


(WV  *  *3  *K  -Y,K"l(l*Y,) 


P(0) 


BP(0) 

(26) 


An  analytical  solution  of  these  equations  leads  to 
complicated  expressions.  However  a  computational 
solution  is  easily  obtained,  since  (13c)  can  be 
solved  iteratively  to  give  p(2)  ...  p(K-l)  m  terns 
of  p(l).  That  is  if  we  define  the  coefficients  o^  by 

p(k)  $  bk  p(l)  for  1  <  k  <  Eel  (16) 
then  using  this  io  (13c)  gives 


Thus  all  t*- ;  P(k)  are  expressible  in  terns  of  P(0). 
When  these  are  inserted  into  (15)  we  find  that  for 
Y,  P  1. 

1  1 
P(0)  »  - p- 


»-Y, 


>*Y, 


♦  P 


1-Y 


(K-1) 


K-1 


»-Y, 


(l-Yji-Yj  2  »k  ♦ 


"K*l 


k*l 


Yjd-Yj.j 

(27) 
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where  0  it  defined  by  (26),  to  til  other  probabili- 
tiet  are  determined.  Here 

P»l-  P(0)  -  p(l)  *  1  -  (1*0)  P(0) 

(26) 

K  K  K 

P,  «  1  -  P(0)  -  2  p  Ik)  *1-1  P(k)  ♦  I  p(k) 

k»l  k»0  k*  1 

•ad  it  before 

m 

E(Q)  ■  I  k  Kk)  (24) 

k*0 

R  .  S1  Ml  P,  ♦  S2  Mj  P2 
eta  tbea  be  calculated. 

Example  1 

Aa  example  with  M, *  30,  p  «  10,  A»  25  «ad  S.«  1, 
S,e  2  w»*  analyzed.  The  results  arc  thowa  in  Fig.  4 
for  both  the  S  and  NS  aodels.  In  tbit  czaaple  E{Q) 
iacreaaet  stonotoaically  with  tbratbold  K  for  both 
aodcla.  For  aosc  valuet  of  K  the  NS  aodel  givei 
•  lightly  lower  value*  became  the  aacoad  server 
operate*  aore  of  the  tiae  than  for  tha  S  aodel.  The 
apparent  inconsistency  for  K*1  it  readily  ezplained 
by  the  fact  that  in  the  NS  SMdel  It  la  possible  for 
the  fatter  terver  (in  tbit  cate  aerver  1)  to  be  idle 
while  the  flower  aerver  ia  buty.  Far  low  valuet  of 
threshold  this  happens  store  often  with  the  result 
that  EIQ]  it  raised.  Exaaple  2  will  present  a  aore 
•triklag  illustration  of  this  effect  m  which  E(Q) 
will  not  be  aonotonic  but  will  actually  display  a 
aiaiaua.  As  A  -  •  both  aodelt  approach  the  fl/H/l 
case  since  the  second  server  is  not  used,  so  lia  E|Q) 
»  A/(p  -A)  *  5.  The  routiog  costs  R  decrease  aonoto- 
nically  with  threshold  K  as  the  aore  costly  route  2 
is  used  less  frequently.  As  expected  NS  has  larger 
values  for  R  than  S  does  because  NS  cannot  switch  a 
customer  being  served  by  server  2  to  server  1  When  Q 
falls  below  tha  threshold  value.  As  R-«,P.  1,  P.  * 
0  so  R  «  25  for  both  aodelt.  The  total1  cost  J2is 
shown  in  Fig.  Ab.  NS  gives  Jain  »  24.197  at  K*  ■  9 
and  S  gives  Jain  ■  29.144  at  K*  *  7  but  the  aiaiaa 
are  fairly  broad.  In  this  exaaple  using  the  simpler 
S  aodel  to  find  the  ainiaua  would  give  perforaanca 
ia^erceptably  different  than  using  the  aore  realistic 
NS  aodel. 


As  X  decreases  we  expect  that  K*  will  increase 
siace  aerver  1  can  aore  easily  serve  tbe  denied  by 
Itself  and  at  tome  point  K*  will  be  infinite  (i.e. 
it  is  best  to  uae  only  route  1).  As  X  increases 
aerver  1  requires  aore  help  in  holding  E|Q)  down  so 
r*  decreases.  In  fact  for  p}  <  A  <  Pj  ♦  p2  server  2 

oust  be  used  tone  of  the  tiae  to  keep  E)Q)  finite. 

Fig.  5  shows  the  total  deity  T  *  J/X  for  tbe  NS 
server  of  Exaaple  1  with  A  *  15,  25  and  35.  For  A  * 
15,  T  decreases  aonotonically  with  K  giving  K*  *  ■ 
and  T  ala  »  16/15  ■  1.067.  For  X  *  35  we  tee  that 
T-“*  as  !£■••  since  A  >  p  .  A  ainiaua  occurs  it  K*  *  4 
giving  T  ain  a  50.2/351*  1.434.  The  X  n  25  curve  is 
the  NS  curve  of  Fig.  4b  divided  by  25. 


Ia  Exaaple  1,  the  routs  with  fuller  routing 
delay  I,  also  bad  tbe  faster  server  (Pj  >  w, )  so  tbe 
delay  fa  service  using  server  1  and  tha  *delay  to 
reach  tbe  destination  both  (aver  route  1.  It  is 
evident  that  situations  can  ante  where  these  two 
effects  opppose  each  other  to  that  if  Pj/p,  is  large 
enough  tbea  even  if  S.  «■  S.  route  I  esn  still  be  tbe 
preferred  route  since  tie  large  delay  m  using 
server  2  outweighs  tbe  shorter  delay  is  reaching  the 
destination  along  rout*  2. 

Exaaple  2 

Mere  Pj  ■  35,  P2  ■  5.  Sj  •  1.1,  Sj  »  1  tnd  X  * 
20.  Fig.  6a  thews  E|Q)  and  R  for  tbe  NS  and  S 


COST 


models.  It  can  be  teen  that  tor  the  MS  swdel  E(CI 
ha*  a  auiwa  as  explained  above.  This  occur*  at  1  - 
6  and  five*  sin  E[Q|  3  1.2uJ.  The  routing  cost*  R 
increase  •ouotunicsiiy  with  x  here  because  Sj  >  S, . 

fig.  6b  shows  that  the  MS  node!  has  a  sioisua  at  X3  • 
2  of  J  am  3  21.116  utile  the  S  Model  has  a  aiaiaua 
of  J  min  3  22.911  at  l*  *  1. 


Example  2  Illustrates  that  the  relative  sixes  of 

S.  and  S,  alone  cannot  be  used  to  deterninc  the  pre- 
foresee  Ordering  of  the  alternate  routes.  An  exact 
deternlnation  can  be  nade  by  finding  E(QJ  ♦  R  f roe 
(29)  with  K  3  1  whan  first  one  route  and  then  the 
other  is  assianed  to  be  preferred.  The  route  yielding 
the  sutler  cost  will  then  be  the  preferred  rouce  1. 
An  approxiaate  and  more  easily  applied  test  caa  be 
used.  Suppose  only  one  cjsconer  is  id  the  systeis. 
If  route  i  is  chosen  then  the  average  delay  to  reach 
the  destination  is  l/|i  ♦  S.  since  the  first  tern  is 
the  average  delay  in  iervi^e  and  the  second  ia  the 
additional  average  delay  encountered  along  the  route. 
Thue  the  stoat  preferred  route  tabled  *1  is  the  one 
with  aaallest  S.  *  I/p.  and  all  alternate  routes  can 
then  be  ordered  “by  this1  quantity . 

Foe  the  values  of  Example  2  we  find  S.  ♦  l/p.  3 
1.13  and  Sj  ♦  l/p2  3  1.2. 

Decentralised  Routing 

The  local  routing  strategy  just  discussed  is 
well  suited  for  use  ia  a  decentralized  routing  schesie 
for  larger  networks.  Sioce  each  aode  n  n  minimising 
the  average  tine  T  that  it  takes  an  eutenng  custo¬ 
mer  to  reach  the  destination,  if  this  information  is 
eichanged  periodically  between  neighboring  nodes  it 
gives  each  node  the  intonation  needed  to  update  its 


S  values  Thus  fci  example  if  t  is  the  propaga¬ 
tion  delay  fro*  a  node  along  iont.  i  to  ne iglibor ir.g 
node  j  then  S.  3  t  »  T.  will  be  axed  in  the  local 
routemg  strategy.  1  W i c a*  each  update  a  node  would 
recalculate  tbe  threshold  values  on  whicb  its  real 
time  routing  decision  is  based  using  the  latest 
value  of  S  ,  u  and  A  These  information  updates 
would  be  aide  it  a  rate  much  slower  man  required 
for  thn  local  routing  decisions. 

Conclusion 

This  paper  suggests  am  alternate  route  strategy 
which  simply  compares  the  number  o:  customers  Q  ac  a 
■ode  against  an  optimal  threshold  value  K*.  When 
the  threshold  ia  ex'-entid  then  come  cratfic  is 

diverted  along  an  altxrnata  route.  The  real  time 
strategy  is  dynamic  because  it  adapee  its  routing  to 
the  slxe  of  Q.  It  ie  also  adaptive  in  the  sense 
that  K*  caa  be  recomputed  readily  whom  operatic* 
peraaaters  such  as  |i . ,  A  or  S,  change.  Thus  it 
saeae  Ideally  suited1  for  use  la  a  decentralised 
routing  procedure  for  large  networks  where  e  low 
rate  exchange  of  information  between  neighboring 
node*  cam  be  ueed  to  update  the  operating  thres¬ 
holds. 

IDIUKU 

1.  C.X.  Agnew  ”0n  Quadratic  Adaptive  Routing 
Algorithms",  Com.  of  ACM,  vml.  19.  Ne.  1  pp. 
11-22;  Jan.  1976. 

2.  6.  Fenchinl,  J.  Salx  "A  Reeic  Dynamic  Routing 
Problem  end  Diffusion" , IEEE  Trans  Comm.,  vol. 
CQH-2E,  Mo.  3.  pp.  321  327;  March  197S. 

3.  A.  Ephreaedes,  P  Vaiaiya,  J.  Wtirand,"  A 
Simple  Dynamic  Routing  Problem,”  IEEE  Trana 
Aut .  Control,  vol.  AC-23,  No.  6,  pp.  690-693; 
Aug. ,  19R0. 

6.  T.S.  Turn  "The  Design  and  Analysis  of  a  Sami  - 
dynamic  Deterministic  Routing  Rule,"  IEEE 
Trane.  Come.,  vol.  COM-29,  No.  6;  pp.  691-506; 
April,  1911. 

5.  T.S.  Turn,  M.  Schwartz  "The  Join-Biased -Queue 
Rule  and  it*  Application  to  Routing  in  Comnuter 
Communication  Networks,"  IEEE  Tracs.  Comm., 
vol.  COM-29,  Ko.  6;  pp.  S05-511;  April  1931. 

6.  R  Boorstyu,  A.  Livne  "A  Technique  for  Adaptive 
Rooting  m  Networks,"  IEEE  Trios.  Coosa.,  vol. 
C OH-29,  Ns.  6,  pp.  676-wBO,  April  1981 . 

7.  P.E.  Sarschik  sod  0.  Gzguner,  "On  Decentrxlized 
Routing  in  Traffic  Networks."  IEEE  Trans  lut . 
Control,  (to  appear  1912). 

8.  P.E.  Sarschik,  "Decentralized  Dynamic  C'.exrmg 
of  Congested  Mult i -Ces t in* t .on  Networks." 
Proc.  of  19th  Allertoo  Conference,  pp.  797-303, 
Oct.  1911. 


B.7  optimal  Control  of  M/M/2  Queues 


B.  Maglaris 


OPTIMAL  CONTROL  OF  M/M/2  QUEUES 


Basil  S.  Maglaris 

Polytechnic  Institute  of  New  York 
December,  1982 


Abstract 

The  optimization  of  two-server  Markovian  queues  with  unequal 
rates  of  service  is  analyzed  via  the  theory  of  Markov  decision  pro¬ 
cesses  using  an  average  expected  cost  criterion.  It  is  shown  that  the 
optimal  policy,  that  minimizes  the  steady-state  average  time  delay 
experienced  by  customers  in  the  system,  is  a  deterministic  threshold. 
Under  this  policy,  the  slower  server  is  used  whenever  the  number  of 
waiting  customers  exceeds  the  threshold  value. 


1.  INTRODUCTION 


Stochastic  optimization  of  queueing  systems  has  been  extensively 
studied  due  to  its  direct  applicability  in  diverse  engineering  and 
economic  disciplines.  Early  works  on  single  server  systems  dealt  with 
optimal  control  of  the  service  rate  in  Markovian  queues  as  in 
[CRABILL  -  72],  [ZACKS  -  70]  and  [BLACKBURN  -  72].  Optimality 
criteria  involved  service  costs/rewards,  service  rate  switching  penal¬ 
ties,  customer  delays  and  start-up/shut-down  costs.  In  [NAOR  -  69] 
the  queue  si2e  was  controlled  by  imposing  tolls  to  arriving  customers 
and  in  [KNUDSEN  -  72],  this  was  generalized  to  multiple  server 
systems.  In  several  queueing  control  problems  the  optimal  strategies 
depend  on  thresholds  on  the  queue  size.  For  excellent  reviews  on 
the  methodology  involved,  see  [PRABHU  -  73]  and  [STIDHAM  -  82], 
where  specific  hints  are  presented  on  how  to  apply  the  general  semi- 
Markov  decision  theory,  [HOWARD  -  71],  [ROSS  -  69]  to  queueing 
processes . 

A  wealth  of  applications  requiring  optimal  control  of  queues 
emerged  within  the  context  of  store-and-forward  data  communication 
networks.  Switching  nodes  in  the  network  can  be  modeled  as  queueing 
systems  with  various  streams  of  incoming  data  and  several  output 
options.  It  was  found  that  adaptive  routing  at  the  node  level  signifi¬ 
cantly  improves  the  dela>  performance  of  data  messages  as  reported 
in  [BOORSTYN  -  81],  [YUM  -81],  In  [FOSCHINI  -  78]  a  diffusion 
approximation  was  used  to  analyze  node  models  where  newly  arriving 
messages  had  the  option  to  join  queues  dedicated  to  alternate  routes. 
In  [EPHREMEDES  -  80],  techniques  from  the  optimal  control  of  queues 
were  employed  to  derive  optimal  strategies  for  the  above  node  model. 
In  [SARACHIK  -  82],  the  adaptive  routing  problem  was  formulated  as 


an  M/M/2  queue  with  different  service  rates,  and  based  on  previous 
results,  (oARACHIK  -  81]  pertaining  to  deterministic  input  models,  a 
threshold  policy  was  anticipated  to  minimize  delay.  Note  that  an 
M/M/2  queue  exhibits  lower  delay  than  two  dedicated  queues  since 
arriving  messages  in  the  latter  case  decide  on  a  route  by  estimating  a 
priori  which  server  will  become  available  first.  In  figure  1  we  illus¬ 
trate  an  M/M/2  queue  with  two  unequal  service  rates,  pa  >  .  If 
the  two  rates  are  not  substantially  asymetric,  immediate  use  of  any 
available  server  on  a  FCFS  basis  appears  to  be  an  efficient  way  to 
empty  the  system.  At  the  other  extreme,  however,  with  server  a 
many  times  faster  than  b,  it  is  reasonable  to  defer  use  of  b  unless 
the  number  of  messages  in  queue  is  large  enough  to  feed  the  fast 
server  for  the  duration  of  service  in  b.  Obviously,  the  "customer" 
in  b  receives  unfair  service  but  the  average  system  performance  im¬ 
proves.  This  is  an  illustration  of  the  discrepancy  between  "indivi¬ 
dual"  and  "social"  optimization  as  reported  in  [KNUDSEN  -  72].  In 
what  follows,  we  formulate  the  M/M/2  optimization  as  a  Markov  decision 
process  and  prove  that  the  optimal  policy  under  certain  conditions  is 
of  the  threshold  type. 

2.  MARKOV  DECISION  PROCESS  FORMULATION 

The  optimization  problem  involves  the  minimization  of  the  average 
number  of  customers  (messages)  in  the  system,  including  those  in 
service.  This  is,  by  Little's  formula,  equivalent  to  minimizing  the 

average  message  delay. 

The  state  must  indicate  the  number  of  customers  in  the  system 
and  denote  whether  each  of  the  servers  a  and  b  is  busy  servicing  a 

customer.  Let  the  state  be 
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0  Empty  system 

ia,  i  >  1  :  i  customers  in  the  system  and  only  server  a  busy 

ib ,  i  >  1  :  i  customers  in  the  sytem  and  only  server  b  busy 

i ,  i  >  2  :  i  customers  in  the  system  and  both  a  and  b  busy. 

Actions  may  be  taken  whenever  an  arrival  finds  one  or  both 
servers  empty  and  a  choice  must  be  made  whether  or  not  to  serve  or 
which  server  to  use,  and  whenever  a  departure  creates  the  opportu¬ 
nity  for  a  waiting  customer  to  proceed  to  a  server.  In  order  to  "dis¬ 
cretize"  the  continuous  time  process  defined  on  the  state  space  above, 
we  uniformize  with  respect  to  R,  the  maximum  total  rate  out  of  all 
states.  Obviously  R  =  \  +  pa  +  p^  corresponding  to  a  state  i,  with 
both  servers  busy.  For  states  ia,  ib  and  0  we  now  consider  transi¬ 
tions  at  rate  R  some  of  which  represent  arrivals  or  departures  and 
others  represent  "self  loops"  or  transitions  not  triggered  by  external 
events.  As  an  example,  from  state  ia,  i  >  2,  we  may  ha /e  the  follow¬ 
ing  transitions: 

Upon  an  arrival,  with  rats  \  ia  -*•  (i  +  l)a  OR 

ia  -*■  (i  +  1) 

Upon  a  departure,  with  rate  p  ia  -*•  (i  -  l)a  OR 

d 

ia  -*  (i  -  1)  OR 

ia  -*■  (i  -  l)b 

Remaining  transitions  with  rate  p^  la  -  la  OR 

ia  -*•  i 

This  uniformization  technique  if  suggested  in  [STIDHAM,  82]  as  a 
means  to  simplify  the  aigebra  involved  with  the  optimality  equations 
without  changing  the  steady  state  statistics  of  the  process. 


4 
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The  possible  transitions  out  of  a  state  depend  on  the  actions 
taken  at  transition  points,  whenever  there  is  a  choice.  The  action 
space  is  finite;  either  serve  the  customer  at  the  head  of  the  queue 
with  a  specific  available  server  or  not.  No  swapping  is  allowed  i.e.  a 
message  already  in  server  a  may  not  be  switched  to  server 
b.  Furthermore,  we  assume  that  if  there  are  customers  in  the  system, 
at  least  one  server  must  be  active.  This  assumption  may  be  proved 
but  we  omit  the  formalism  to  provide  a  clear  picture  of  the  state 
transitions.  In  Figure  2  we  depict  the  state  diagram  with  all  candidate 
transitions.  Solid  lines  and  circles  identify  a  set  of  actions  leading  to 
a  threshold  policy  at  i  =  3.  Under  that  policy  all  dotted  states  are 
transient  and  do  not  influence  the  steady  state  averages.  Rates  such 
as  |^,  denote  transitions  opposing  to  the  threshold  policy  but  valid 
candidates  for  an  optimal  policy. 

The  optimal  policy  n,  would  minimize  the  average  expected  cost  g 
defined  as  the  average  number  of  customers  j  under  n: 

g  =  min  (E^Cj) } 
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If  C(Xn)  denotes  the  cost  of  state  XR  at  the  n^1  period  between  the 
n^1  and  the  (n  +  l)st  transition  (with  average  dwell  time  1/R),  then 

E„<i>  =  Iim  " «Vl/xo> 

N-*»  n=0 

The  one  period  cost  C(Xn)  is  exactly  the  number  in  system  at  Xn 

C(i)  =  C(ia)  =  C(ib)  =  i 


In  order  to  use  the  results  of  the  Markov  Decision  theory  direct¬ 
ly,  we  first  limit  the  queue  capacity  (maximum  number  of  customers) 


to  a  large  but  finite  number  M.  Arrivals  into  a  full  system  are 
blocked  and  cleared.  The  minimization  criterion  involves  customers 
already  in  the  system  and  ignores  the  effect  of  blocking.  This  trun¬ 
cation  is  a  standard  technique,  [PRABHU  -  73]  to  alleviate  problems 
induced  by  unbounded  one  period  costs  and  to  handle  ergodicity 
requirements  for  average  expected  cost  minimization.  Certainly  if  M 
is  large  enough  and  the  process  stable,  truncation  is  a  good  approx¬ 
imation.  Furthermore,  it  will  be  shown  that  the  form  of  the  optimal 
policy  does  not  depend  on  M  and  thus  we  may  postulate  that  if  a 
stable  optimal  policy  exists  it  is  of  the  same  form  as  in  the  truncated 
problem. 

Under  the  modeling  assumptions  stated  above,  all  conditions  for 
the  existence  of  a  deterministic  stationary  optimal  policy  are  satisfied, 
[ROSS  -  69,  p.  149]  and  thus  the  policy  tt  minimizing  the  average 
number  of  customers  in  the  system  is  a  deterministic  assignment  of 
actions  to  states. 

3.  FORM  OF  THE  OPTIMAL  POLICY 

In  order  to  determine  the  form  of  the  optimal  policy,  we  examine 
the  equivalent  discounted  cost  problem  with  a  discount  factor  a  <  1. 
As  shown  in  [ROSS  -  69]  this  problem  can  be  transformed  into  an 
average  expected  cost  minimization  by  letting  a  •*  1 . 

Let  V(i),  V(ia),  V(ib)  denote  the  a-optimal  costs  starting  from 
states  i,  ia,  and  ib  respectively.  With  Xn  representing  the  state 
during  the  n^  period.  We  have 

00 

V(xn)  =  min  E  { [  I  a11  C(Xn)]/XnJ 
u  n  n  n=0  n 

XQ,Xn  e { i , ia , ib } ,  C(i)  =  C(ia)  =  C(ib)  =  i 


The  recursive  optimality  equations  for  the  V's  are  given  below: 


(X+VV  V(0)=a  [A*min{V(la),  V(lb)}  +  (Ma+Mb)V(0))  (1) 

(X+(Ja+Mb)  V(la)=l+or[X-min{V(2a) ,V(2)}  +  Ma  V(0)+pbV(la)]  (2) 

(A+pa+pb)V(lb)=l+a[A*min{V(2b) i v(2)i  ♦  Mb  V(0)+MaV(lb)]  (3) 

(A+u  +p.  )V(2a)=2+a(A*min{V(3a) ,  V(3) }+  u  min{V(la),V(lb)}  + 

a  D  a 

Pbmin{V(2a),V(2)}]  (4) 

(X+jj  +Mb)v(2b)=2+a[X-min{v(3b),v(3)}+Pbmin{v(la)-v(lb)J  + 

u  min{V(2b),V(2)}]  (5) 

a 

(X+Ma+Mb)V(2)=2+a[X.V(3)+Ma  V(lb)+pb  V(la)]  (6) 


(X+pa+(jb)V(i,a)=i+a[X-min{V(i+l,a),V(i+l)}+|Jamin{V(i-l,a),V(i-l,b),V(i-l)} 

+Mbmin{V(i,a),V(i)}] ,  i>2  (7) 

(X+Ma+Mb)V(i,b)=i+a[X-min{V(i+l(b),V(i+l)}+pbmm{V(i-l,b),V(i-l,a),V(i-l)} 

+u  min{V(l,b),V(i)}] ,  i>2  (8) 

a 

(X+Ma+Mb)V(i)=i+a[X-V(i+l)+Mbmin{V(i-l),V(i-l,a)} 

+(j  min{V(i-l)  ,V(i-l  ,b) }  ] ,  i>2  (9) 

a 

If  M  is  the  maximum  (blocking)  state  : 

(A+u  )V(M,a)=M+a[A*V(M,a)+u  min{V(M-l ,a) , V(M-1  ,b) , V(M-l) } 

a  D  a 

+Mbmin{V(M,a)(V(M)}]  (10) 

(X+Ma+Mb)V(M,b)=M+a[X-V(M,b)+Mbmin{V(M-l,b),V(M-l,a),V(M-l)} 

+H  min{V(Mf b) ,  V(M) }  ]  (11) 

a 

(X+pa+Mb)V(M)=M+a[X-V(Mi+Mbmin{V(M-l),  V(M-l,a)}  + 

+Mamin{V(M-l),V(M-l,b)}  (12) 

a 

Let  R  denote  the  unformization  rate: 


LEMMA  1 

The  functions  V(i),  V(ia),  V(ib)  are  increasing  in  i  and 
V(la)>  V(0),  V(lb)>V(0) ,  V(2)>V(la),  V(2)>V(lb) 

Proof 

We  will  use  induction  on  the  recursive  equations  (l)-(9).  First 
assume  that  we  start  with  a  set  of  V’s  satisfying  the  Lemma.  A 
logical  choice  is  to  set  the  cost  at  step  0  equal  to  the  one  period 
costs : 

VO  =  Voaa)  =  Vib)  =  1 

Suppose  now  that  the  lemma  holds  after  n  subsequent  applications  of 
the  recursive  equations  to  Vq: 

VnCi),  Vn(ia),  V^Cib)  are  increasing  on  i 
To  complete  the  proof  we  must  show  that  the  lemma  will  hold  at  the 
(n+T)st  step.  We  will  demonstrate  this  step  for  vn+1(0,  i>2.  From 
(9)  we  have: 

R  Vn+1  ( i+1  )=i+l+cf  f  Wn  ( i+2  )+Mbmin  { Vn  ( i ) , Vn  (i ,  a ) } + 

+Mamin{Vn(i),Vn(i,b)}] 

RVn+l(i)=i+a[XVn(i+1)+Mbmin{Vn(i'1)'Vn(i'1'a)+ 

+Mamin{Vn(i-l),Vn(i-l,b)}] 

Now,  i+l>i,  Vn(i+2)>Vn(i+l)  (by  assumption),  min  (Vn(i),Vn(i,a)}  > 
min{Vn(i-l)/Vn(i-l,a)}  and  min{Vn(i),Vn(i,b)}>min{Vn(i-l),Vn(i-l,b)} 
since  the  minimum  of  two  increasing  functions  is  increasing.  Thus 
RVn+1(i+D  >  Rvn+]/0-  Using  the  same  reasoning  it  can  be  easily 
seen  that  all  costs  V’n+1(i),  ( ia ) ,  Vn+^(ib)  are  increasing  on  i  up 

to  its  maximum  M  and  that  vn+^(2)  >  vn-fVa)  >  Vn+1(0) , Vn+^(2)  > 
V  .i(lb)  >  V„+1(0). 


LEMMA  2 


If  Ma>Mb'  v(la)<v<lb)<  and  V(i)<V(ib),  i>2 

Proof 

We  use  the  same  inductive  argument  as  in  the  proof  of  Lemma  1. 
Assuming  that  the  inequalities  hold  at  step  0  and  step  n,  we  have  at 
the  (n+l)st  step 

RVn+1(la)  =  1  +  a[A-min{Vn(2a),Vn(2)]  +  MaVn(0)  ♦  MbVn(la)} 
RVn+l(lb)  =  1  +  atxvn(2)  +  MbVn(0)  +  MaVn(lb)} 

In  order  to  show  that  Vn+1(la)<Vn+1(lb) ,  since  V^(2)  >  min  {Vn(2a), 
Vn(2)},  it  suffices  to  show  that 

MaV0)  *  %Vla>  <  Mbvn<°)  *  MaVn(lb)  or 
MaVlb)  -  MbVn(l«)  >  (Ma  -  Mb)Vn(0) 

Since  V  (lb)  >  V  (la)  (by  inductive  assumption),  p  >p.  and 
n  n  a  d 

Vn(la)>Vn(0)  (Lemma  1),  it  follows  that 

Mavn(lb)  -  Mbvn(la)>(pa  -  Mb)Vn<la»(Ma  -  pb)Vn(0) 

This  proves  the  first  argument,  V(la)<V(lb),  assuming  Vn+1(i)  < 
Vn+1(ib).  From  (5)  and  (6),  applying  the  inequalities  at  step  n,  we 
have 

RVn+l(2)  =  2  +  a£xvn(3)  +  MaVn(lb)  *  MbVn(la)] 

RVn+1(2b)  =  2  +  or[\Vn(3)  +  MbVn(la)  +  MaVn(2)] 

From  Lemma  1,  we  have  vn(2)  >  Vn(lb),  thus  Vn+^(2)  <  Vn+1(2b) 
From  (8)  and  (9),  we  have  that  for  i>2 

RVn+l(i)=i+a[XVn(i+1)+Mbmin{Vn(i'1)'Vn(i'1'a)}+MaVn(i'1)] 

RVn+1(ib)ai+0f^Vn(i+l)+kibmin{Vn(i-l,a),Vn(i-l)]+paVn(i)] 

and  since  V  (i)>V  (i-1)  it  follows  that  V  ,  (i)<V  ,(ib). 


Note  that  we  make  use  of  arguments  from  Lemma  1  in  the  induc¬ 
tive  proof.  This  is  valid  since  the  two  Lemmas  are  consistent  and 
may  start  from  the  same  initial  set  of  V's.  Alternatively,  one  can 
combine  the  two  Lemmas  into  a  single  inductive  proof. 


COROLLARY  1 

States  ib,  i>l  are  transient  under  the  optimal  policy. 

Proof 

Observing  the  optimality  equations,  we  notice  that  under  Lemma 
2,  there  is  no  path  from  states  (i)  and  (ia)  into  any  state  (ib),  i>l. 
State  (lb)  may  be  reached  upon  departure  from  2.  This  statement 
conforms  with  the  intuitive  argument  that  there  is  no  reason  to  use 
the  slow  server  when  the  fast  server  is  idle. 

LEMMA  3 

If  V(i-l,a)>V(i-l),  i-l>l,  then  V(j,a)>V(j)  for  all  j  >  i 

Proof 

Assume  that  in  the  optimal  policy,  there  exists  a  state  i-1,  such 
as  V(i-l,a)  >  V(i-l).  In  this  case,  we  will  use  the  inductive  procedure 
again  but  now  keeping  the  costs  of  states  (i-1, a)  and  (i-1)  fixed  to 
their  optimal  values.  The  iterative  application  of  the  optimality  equa¬ 
tions  to  the  remaining  costs,  will  converge  by  contraction  to  their 
optimal  values.  The  a-priori  knowledge  of  V(i-l,a)  and  V(i-l)  will 
reduce  the  number  of  unknowns,  and  if  our  guess  is  correct,  it 
just  creates  two  redundant  consistent  equations. 

At  step  0  we  set  V  (i-1  ,a)=V(i-l ,a)  and  VQ(i-l)=V(i-l).  The 
remaining  VQ's  are  set  to  arbitrary  initial  values  satisfying  the  Lemma 
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and  the  monotonicity  property  of  Lemma  1,  in  reference  to  V(i-l,a) 
and  V(i-l).  It  can  be  easily  seen  that  the  monotonicity  property  will 
hold  throughout  the  subsequent  operations,  by  translating  the  proof 
of  Lemma  1,  to  incorporate  the  fixed  values  V(i-l,a)  and  V(i-l). 

Assume  that  at  step  n  the  Lemma  holds.  Remember  that  no 
change  is  allowed  for  the  cost  of  states  (i-2,a)  and  (i-1).  From  (7) 
and  (9),  we  have 

RVn+l(i'a)=i+a[XVn(i+1)+MaVn(i'1)+MbVn(i)]  *  Vn(i-l)=V(i-l) 
RVn+l(i)=i+0,[AVn(i+1)+MbVn(i'1)+>JaVn(i'1)1 

and  since  Vn(i)>Vn(i-l) ,  it  follows  that  vn+i(*<a)>vn+l(i) •  The  same 
equations  apply  for  all  j  >  i.  At  j=M,  Vn(j+1)  is  replaced  by  Vn(M) 
and  similar  inequalities  hold.  This  completes  the  proof  of  the  Lemma. 


THEOREM  la 

The  a-optimal  policy  for  the  finite  M/M/2  queue,  defers  service 
from  the  slow  server  until  the  number  of  waiting  customers  exceeds  a 
threshold  value. 

Proof 

From  Lemma  3,  it  follows  that  for  the  least  K  >  1  for  which 
V(K+l,a)>V(K+l),  all  states  (ia),  i>K,  will  exhibit  greater  costs  V(ia) 
than  V(i).  Thus  new  arrivals  and  departures  will  favor  states  (ia), 
for  i  <K  and  states  (i)  for  i>K.  This  threshold  K  will  render  all 
states  (ia),  i>K,  transient  along  with  states  (ib),  i>l  (Corollary  1 
The  threshold  value,  depends  on  the  parameters  A,jj  It.  may  b»* 

that  for  Ma>>M^),  the  threshold  K  approaches  the  limit  of  the  queue 
size  in  which  case,  we  never  use  the  slow  server  b.  Equivalently  if 
Ma/Hb  Is  dose  to  unity,  the  threshold  moves  to  1  which  is  exactly  an 
M/M/2  queue  with  arrivals  on  empty  state  routed  to  server  a. 
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THEOREM  lb 

The  optimal  average  expected  cost  policy  for  the  finite  M/M/2 
queue  is  a  deterministic  threshold  as  in  Theorem  la. 

Proof 

For  the  finite  state  problem,  state  (0)  is  recurrent  for  all  deter¬ 
ministic  policies  considered  in  the  or-optimal  recursive  equations  (1)- 
(12).  Furthermore  the  one  period  cost  i  is  bounded  by  its  maximum 
M  and  hence  all  conditions  for  the  existence  of  optimal  average  expected 
cost  policies  are  satisfied  as  in  [ROSS  -  70],  Corollary  6.20.  The 
structure  of  the  optimal  policy  n  will  be  the  same  as  in  the  discounted 
case  and  the  optimal  average  expected  cost,  g=En(i)  is  given  by 

g  =  lim  (l-a)V  (0) 
or->l  a 

The  truncation  of  Lie  state  space  was  introduced  to  guarantee 
that  the  one  period  costs  are  bounded,  and  that  state  0  is  recurrent 
under  all  policies.  However,  since  the  threshold  structure  does  not 
depend  on  the  maximum  queue  size  M,  it  can  be  concluded  that  if  a 
stable  optimal  policy  exists  to  the  infinite  state  problem,  it  is  of  the 
threshold  type. 

4.  DISCUSSION  AND  NUMERICAL  EXAMPLES 

In  the  previous  section  we  proved  that  the  optimal  policy  mini¬ 
mizing  the  average  queue  length  (or  average  delay)  is  of  the  thresh¬ 
old  type.  Obviously  the  criterion  involves  "social  optimality"  from  the 
system  point  of  view.  Individual  customers  (messages)  would  never 
chose  the  slow  server  regardless  of  the  state  of  the  system  unless 
their  service  time  on  the  slow  server  is  less  than  the  sum  of  their 


service  time  in  the  fast  server  and  the  waiting  time  before  being 
served  by  the  fast  server.  The  optimum  threshold  value  depends  on 
the  parameters  of  the  system,  namely  the  ratio  of  the  service  rates 
and  the  total  system  utilization.  In  order  to  illustrate  this  dependence 
we  solved  the  equilibrium  equations  in  an  infinite  size  system  for 
various  thresholds,  service  rate  ratios  and  traffic  intensities.  In 
Figure  3,  we  depict  the  average  queue  length  at  traffic  intensity 
\/(^a+Pj3)=0.5.  As  expected  if  server  a  is  much  faster  than  b  c.39  to 
1),  the  optimal  threshold  becomes  very  large,  eventually  leading  to  a 
single  server  queue  with  the  slow  server  never  used.  Conversely  for 
relatively  balanced  rates  (2  to  1  or  equal),  both  servers  are  always 
used  as  in  an  uncontrolled  M/M/2  system  (preference  is  given  to  a  if 
both  a  and  b  are  idle).  For  rate  ratios  3  to  1  and  7  to  1  there 
exists  clearly  an  optimal  threshold  at  2  and  4  respectively.  A  con¬ 
sequence  of  using  thresholds  is  that  we  obtain  unbalanced  traffic 
volumes  routed  via  a  and  b.  In  Figure  4,  we  illustrate  the  percentage 
of  the  traffic  routed  via  a,  100  x  \  /\.  Notice,  that  even  with  bal- 

a 

anced  rates  1  to  1  and  K=1  (pure  M/M/2  system),  more  traffic  is 
routed  to  a  than  b,  due  to  decisions  taken  upon  arrivals  to  an  empty 
system.  As  the  threshold  increases,  certainly  the  \  /\  ratio  ap- 
proaches  100%.  This  observation  led  to  the  suggestion  of  threshold 
policies  in  adaptive  routing  schemes,  where  the  traffic  served  via 
each  outgoing  link  (server)  is  constrained  to  a  value  determined  by 
some  global  criterion.  In  [MAGLARIS  -  83),  we  formulated  this 
problem  and  proved  that  threshold  policies  minimize  average  queueing 
delays  with  constrained  output  flows. 
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Finally,  in  Figure  5,  we  demonstrate  the  effect  of  the  traffic 
intensity  on  the  optimum  threshold  value.  As  intuitively  expected, 
queues  dose  to  saturation  do  not  have  the  luxury  of  threshold  poli¬ 
cies  and  must  operate  both  servers  at  full  capacity.  Lightly  utilized 
queues  do,  however,  exhibit  improved  performance  under  threshold 
values  which  increase  as  the  utilization  drops .  Depending  on  the 
Pa:Pb  rati°'  a  system  utilized  at  low  levels  would  involve  thresholds 
approaching  a  positive  integer  close  to  p  /p,  -  1.  As  an  example, 

for  Pa:Pb  =  7,  and  very  low  traffic  intensity,  the  optimal  policy  uses 
the  slow  server  only  if  7  messages  have  been  accumulated,  i.e.  the 
threshold  value  is  6. 


/ 
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ABSTRACT 

We  consider  several  versions  of  the  Knapsack  Problem  and  show 
that  they  can  be  solved  using  a  shortest  path  algorithm  requiring  both 
storage  and  running  time  which  are  polynomial  functions  only  of  the 
number  of  types  of  object  and  the  size  (weight)  of  a  single  object. 
This  is  a  significant  improvement  over  previous  algorithms  for  the 
solution  of  the  Knapsack  Problem  using  shortest  paths,  which  typically 
require  storage  and  runtime  which  are  functions  of  the  total  knapsack 
size. 

I.  INTRODUCTION 

The  Knapsack  Problem,  which  is  interesting  in  its  own  right,  has 

T31 

also  received  much  attention  recently1  1  as  a  means  of  solving  integer 
programming  problems  via  a  relaxation  technique  using  group  theory. 
Formally,  the  Knapsack  Problem  can  be  stated  as: 

M 

Maximize  z  =  l  p.x. 

i=l  1  1 


♦This  work  was  partially  supported  by  NSF  under  Grant  ENG  7908120 
and  by  U.S.  Army  CECOM  under  Contract  DAAK-80-80-K-0579. 


M 

subject  to  =  I  w.x.  =  W 
i=l  1  1 

Xj  >  0  for  i  =  1 , 2 ....  M 
x.  integer 

Thus,  we  are  given  M  types  of  object  with  weight  w.  and  profitability 
p.  per  object  of  type  ..  We  are  required  to  maximize  the  total  profit 
subject  to  a  constraint  that  the  sum  of  the  weights  of  the  objects 
selected  equals  W.  In  the  simple  (unbounded)  version  of  the  problem 
there  is  no  restriction  on  the  number  of  objects  of  each  type  which 
may  be  included  in  the  solution. 

Sometimes  an  inequality  constraint  is  used  in  place  of  the  equal¬ 
ity  constraint,  i.e.,  one  seeks  objects  whose  aggregate  weight  does 
not  exceed  W.  Such  a  problem  can  be  transformed  into  a  problem 
with  an  inequality  constraint  by  including  an  additional  object  with 
unit  weight  and  zero  profitability.  Here  we  will  concentrate  primarily 
on  problems  with  equality  constraints. 

We  assume  that  W  and  the  w.  are  non-negative  integers  (or, 
more  generally,  that  they  are  commensurate)  and  that  the  Pj  are 
non-negative. 

Previous  approaches  to  the  solution  of  this  problem  include  those 
reported  by  Horowitz  and  Sahni^  using  dynamic  programming  and 
implicit  enumeration.  These  approaches  have  been  found  to  be  effec¬ 
tive  for  a  wide  range  of  problems  but  have  exhibited  excessive  run¬ 
times  for  problems  with  a  large  number  of  nearly  equally  valuable  (in 
terms  of  cost  per  unit  weight)  objects.  Also,  their  worst  case  run¬ 
ning  times  are  exponential  in  the  number  of  object  types.  Denardo 
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and  Fox1  1  have  developed  an  approach  which  overcomes  these  ob¬ 
jections  by  solving  the  problem  as  a  shortest  path  problem  in  a  net¬ 
work  with  W  nodes  and  W  x  m  arcs.  Thus,  their  approach  requires 
storage  proportional  to  W  and  runtime  proportional  to  W  x  m.  If  W 
and  M  are  sufficiently  small,  their  approach  is  extremely  attractive. 
If  W  is  large,  however,  the  storage,  and  to  a  lesser  extent  the  run¬ 
time,  become  prohibitive. 

Denardo  and  Fox  present  their  algorithm  in  the  context  of  using 
a  Knapsack  Problem  as  an  integer  programming  relaxation.  In  such 
cases,  W  is  generally  of  the  same  order  as  the  w..  We  extend  their 
approach  to  the  case  where  W  is. very  large  but  the  individual  w^  are 
not,  and  exploit  the  underlying  cyclic  group  structure  of  the  problem 
to  develop  an  algorithm  with  both  runtime  and  storage  requirements  a 
function  ■  only  of  the  weight  of  a  single  object  and  the  number  of 
object  types.  Garf inkle  and  Nemhauser^  point  out  the  relationship 
to  the  group  structure  but  do  not  discuss  application  of  the  structure 
to  develop  an  algorithm  as  efficient  as  the  one  presented  here. 

II.  BASIC  PROCEDURE 

For  the  sake  of  clarity,  we  begin  by  describing  the  basic  proce¬ 
dure  and  justifying  it.  In  later  sections,  extensions  and  refinements 
of  this  procedure  are  considered. 

We  begin  by  defining  a  network  K  =  (N,A,L)  corresponding  to 
the  Knapsack  Problem  (KP).  The  node  set,  N,  contains  W  +  1  nodes 
numbered  0  through  W  corresponding  to  the  aggregate  weight  of  the 
objects  selected  thus  far.  At  each  node,  i,  there  are  outgoing  arcs 


corresponding  to  feasible  objects  which  may  be  selected  to  augment  a 
partial  solution  with  weight  i.  Thus,  at  node  i  there  will  be  arcs 
(i,  i  +  wK)  for  all  K  such  that  i  +  w^  <  W.  The  length  of  an  arc 
corresponding  to  an  object  of  type  K  is  p^.  We  thus  have: 

N  =  {0,1,...  W} 

A  =  {(i,  i  +  w^)!  i  =  0,1,... W,  for  all  k  such  that  i  +  w^  <  W} 

L  +  {1H  !  (i,j)eA,  where  1,.  =  p.  for  arcs  corresponding  to  tyr 
k  objects} 


The  network  thus  contains  approximately  MW  arcs.  We  will  assume 

that  wv  f  w.  for  kt  f  k2  as  if  two  types  of  objects  have  the  same 
K1  k2 

weight  the  one  with  the  smaller  profit  can  be  ignored.  We  also  assume 
w^  >  0  and  p^  >  0  for  all  k.  We  will  refer  to  arcs  corresponding  to 
a  type  k  object  simply  as  type  k  arcs.  Figure  1  illustrates  part  of  a 
network  corresponding  to  a  KP  with  4  types  of  object,  described  in 
the  table  accompanying  Figure  1.  For  the  sake  of  clarity,  only  nodes 
0  through  7  and  arcs  from  nodes  0  and  1  are  shown  in  Figure  1. 

Paths  from  node  0  to  node  W  in  such  networks  correspond  to 
solutions  of  the  associated  KP.  The  length  of  a  path,  defined  as  the 
sum  of  the  lengths  of  the  arcs  in  the  path,  is  the  total  profit  of  the 
solution.  Thus,  the  longest  path  corresponds  to  the  optimal  solu¬ 
tions.  Note  that,  by  the  assumption  that  w^  >  0  for  all  k,  the  net¬ 
work  is  acyclic  and  so  the  problem  of  finding  the  longest  path  from  0 
to  W  is  solvable  using  Bellman's  shortest  path  algorithm  suitably  modi¬ 
fied  to  find  longest  paths.  Defining  d.  to  be  the  length  of  the  cur¬ 
rent  estimate  of  the  longest  path  from  node  0  to  node  i,  this  amounts 
simply  to: 

that  w.  f  w.  for  Ki  ^  K2  as 
k  1  k2 
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Algorithm  1 


Step  0:  d0  <-  0 

d.  «-  »  i  =  1, . . .  .W 
Step  1:  For  i  =  1  to  W 

For  all  j  such  that  (i,j)eA 
(dj  =  max  (dj,dj  +  ljj) 

One  can  also  keep  track  of  the  solution  itself  by  setting  a^  <-  i  when¬ 
ever  the  value  of  dj  is  updated.  At  the  end  of  the  algorithm  one  can 
trace  the  path  back  from  node  W  to  node  0  using  the  aj.  We  refer  to 
dj  as  the  label  on  node  j  and  to  aj  as  the  predecessor  of  node  j. 
Step  1  is  known  as  a  scan  of  each  node,  i. 

Algorithm  1  is  the  simplest  possible  shortest  path  algorithm. 
The  nodes  are  scanned  in  numerical  order  and  each  node  is  scanned 
only  once.  The  algorithm  has  a  running  time  on  the  order  of  MW 
operations  and  a  storage  requirement  on  the  order  of  W.  If  W  is 
very  large,  the  approach  becomes  unattractive.  The  approach  des¬ 
cribed  below  avoids  this  problem  by  solving  the  problem  using  a 
network  which  is,  in  general,  much  smaller.  The  key  to  reducing  the 
size  of  the  network  is  embodied  in  the  following  lemmas.  Define 

w  =  max  (w,  )  and  w*  =  w.  such  that  =  max  k.  Thus  w  is  the 
k  *  J  w  k  wk_ 

weight  of  the  "heaviest'7  type  of  object  and  w*  is  the  weight  of  the 
"best"  type  of  object. 

The  proofs  of  some  of  the  lemmas  rely  upon  elementary  proper¬ 
ties  of  cyclic  groups.  The  interested  reader  is  referred  to  [6]  for 
more  details. 
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Lemma  1:  An  optimal  KP  solution  exists  containing  no  set  of 


objects  whose  aggregate  weight  is  a  multiple  of  w*. 

Proof:  Given  an  optimal  KP  solution  containing  a  set  of  objects 
of  aggregate  weight  nw*,  we  can  replace  these  objects  by  n  "best" 
objects  without  altering  the  total  weight.  The  net  change  is  profit, 
Ap  is  defined  by: 

Ap  =  np*  -  I  p. 

j  J 

f  p*  -  Pj> 

By  the  definition  of  p*,  each  term  of  the  sum  is  non-negative 
and  hence  Ap  >  0.  We  can  thus  replace  the  set  without  destroying 
optimality  and  the  lemma  is  proven. 

The  terms  in  the  above  sum  can  be  interpreted  as  the  amount  of 

profit  "wasted"  by  using  a  type  j  object  instead  of  a  best  object.  We 

can  in  fact  redefine  the  arc  lengths  in  the  network  corresponding  to 

a  given  KP  to  be  precisely  these  quantities.  The  optimal  KP  solution 

then  corresponds  to  the  shortest  path  from  0  to  W.  This  can  be  seen 

to  be  true  by  observing  that  the  above  transformation  of  the  arc 

lengths  affects  the  lengths  of  all  paths  from  0  to  W  in  precisely  the 

same  way.  If  the  length  of  a  0  to  W  path  in  the  original  network  was 

w 

p  the  length  of  the  same  path  in  the  new  network  is  p*  —  *  -  p 
Thus  the  relative  length  of  all  0  to  W  paths  are  reversed.  Alterna¬ 
tively,  one  can  observe  that  the  optimal  path  is  the  one  which  wastes 
the  least  profit.  In  the  sequel,  we  use  this  form  of  the  length  func¬ 
tion  as  it  will  allow  us  to  improve  the  efficiency  of  the  algorithm.  We 
refer  to  the  network  using  this  length  function  as  Kj. 
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In  Figure  1,  the  new  lengths  of  arcs  of  types  1  through  4  are 
1,2,0,  and  2,  respectively.  Note  that  type  3  arcs  are  best  and  that 
best  arcs  have  0  length.  All  arcs  have  non-negative  length.  If  the 
best  type  of  arc  is  not  unique  (e.g.  if  w2  =  9  in  the  above  example), 
it  is  possible  for  the  length  of  more  than  one  type  of  arc  to  be  0. 
To  avoid  confusion,  we  assume  that  the  best  type  of  object  is  unique. 
The  extension  of  the  algorithms  to  this  case  is  straight  forward. 

Lemma  2:  An  optimal  solution  exists  containing  no  more  than 
w*  -  1  non-best  objects. 

Proof:  Any  set  of  w*  or  more  objects  must  contain  a  subset  of 
weight  nw*  for  some  n  >  0.  To  see  this,  consider  a  set  of  w*  ob¬ 
jects,  ilfi2, . .  .L,*.  From  the  sums 

W 

i 

S.  =  (  I  W.  )  modulo  W* 

]  .  k=l  1  k 

i 

If  S.  =  0  then  1  W.  =  nw*. 

]  k=l  1  k 

£ 

If  S.  =  S0  for  £  >  j  then  I  W.  =  nw*. 

J  £  k=j  1  k 

But  there  are  w*  Sj's  and  only  w*  -  1  nonzero  values  of  modulo 

w*.  Hence,  either  S.  =  0  or  S.  =  S.  for  some  £  and  the  set  contains 

J  J 

a  subset  of  weight  nw*.  From  the  proof  of  Lemma  1  we  see  that  we 
can  replace  any  such  subset  by  n  best  elements  without  decreasing 
the  profit.  Thus,  we  need  only  consider  sets  containing  w*  -  1  or 
fewer  non-best  elements  together  with  zero  or  more  best  elements. 
An  optimal  solution  can  always  be  obtained  in  this  way. 

But  since  all  elements  have  weight  no  greater  than  w,  the  ag¬ 
gregate  weight  of  the  non-best  elements  in  any  such  optimal  solution 


will  not  exceed  W*(w*  -  1).  If  W  >  Ww*,  we  need  not  consider  W 
explicitly  as  a  constraint.  We  need  simply  apply  the  above  algorithm 
to  get  dj  for  j  <  ww*,  and  then  complete  the  solution  by  adding  the 
appropriate  number  of  best  objects. 

Indeed,  we  are  only  interested  in  the  weight  modulo  w*  of  a 
partial  solution  since  any  two  solutions  whose  weights  differ  by  a 
multiple  of  w*  differ  only  by  one  or  more  best  objects.  We  thus 
have  : 

Lemma  3:  If  W  >  ww*,  the  optimal  KP  solution  can  be  found  by 
adding  2ero  or  more  best  objects  to  the  set  of  objects  defining  the 
shortest  path  in  the  network  formed  by  merging  all  nodes  with  the 
same  weight  modulo  w*  as  described  above. 

Proof :  Consider  the  original  network  before  the  nodes  were 

merged.  Consider  the  0  to  W  path  corresponding  to  an  optimal  solu¬ 
tion.  By  Lemma  1,  we  may  assume  this  path  contains  no  set  of  arcs 
(correspond  to  objects)  of  weight  nw*  except  for  best  arcs.  Since 
the  network  contains  paths  corresponding  to  picking  these  objects  in 
all  possible  orders,  we  may  assume  that  the  path  under  consideration 
has  all  the  best  arcs  at  the  end.  Consider  only  the  initial  subpath 

T. 

V 

corresponding  to  the  non-best  arcs. 

Now,  consider  what  the  transformation  does  to  this  initial  sub¬ 
path.  All  the  nodes  in  this  path  have  different  weights  modulo  w*. 
Thus  the  transformation  changes  this  optimal  0  to  W  path  in  Kx  to  a  0 
to  W'  path  in  the  network  with  merged  nodes,  where  W'  =  W  modulo 
w*.  Note  that  other  paths,  including  some  optimal  ones,  become 
complex  paths  containing  cycles  because  they  contain  subsets  whose 
aggregate  weights  are  a  multiple  of  w*.  The  path  we  are  focusing  on 


however  remains  a  simple  path  and  can  thus  be  found  using  an  ordi¬ 
nary  shortest  path  algorithm. 

No  new  paths  are  created  by  the  transformation.  Thus,  an 
optimal  KP  solution  can  be  obtained  by  appending  0  or  more  best  arcs 
to  the  shortest  0  to  W'  path  in  the  new  network,  and  the  lemma  is 
proven. 

It  is  thus  possible  to  define  a  new  network 
K2  =  (N,  A,L): 

N  =  {0,1, . .  .w*-l} 

A  =  {(i,j)  |  j  =  (i  +  wfc)  modulo  w*  for  some  K} 

w 

L  =  I (i,j)e  A,  =  —  p*  -  pt  for 

lJ  w*  K 

k  defining  (i,j)} 

This  network  has  only  w*  nodes  instead  of  W.  Furthermore,  we 

note  that  if  (W.  =  W,  )  modulo  w*,  then  type  kx  and  type  k2  arcs 
K1  k2 

are  parallel  and  the  arc  corresponding  to  the  object  with  the  smaller 
Jl.j  can  be  ignored.  Thus,  there  is  at  most  1  outgoing  arc  from  each 
node  to  each  other.  The  total  number  of  arcs  is  thus  at  most 
w*(w*  -  1).  We  have  thus  removed  dependency  on  both  M  and  W  and 
have  a  procedure  whose  running  time  and  storage  are  a  polynomials 
only  in  w*. 
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ALGORITHM  DESCRIPTION 

We  now  describe  the  actual  procedure  in  more  detail  and  analyze 


its  storage  and  runtime. 


■* 


The  following  is  th*  ^s^nce  of  the  procedure- 
Step  0:  (Initialization) 
d0  =  ° 

d.  =  ®  j  =  1 , . . .  w*-l 

where  dj  is  the  current  estimate  of  the  length  of  the  shortest  patn  ^ 

from  0  to  j. 


Step  1:  Find  i,  the  best  node  to  scan  next.  This  is  discussed  in 

detail  below. 

Step  2:  (Scan  node  i) 

For  k  =  2,  . . .  M 

Let  j  =  (i  +  w^)  modulo  w* 

If  dj  >  dj  +  then  dj  =  d.  +  and  PIT  =  k 
where  is  the  length  of  an  arc  corresponding  to  a  type  k  object 

=  w,  (p*/w*  -  p.j  as  defined  above)  and  PR  records  the  type  of 

1;  K.  ft.  j 

tne  arc  used  to  label  node  j  and  hence  keep  track  of  the  optimal 
solution. 

Step  3:  Return  to  Step  1  if  any  nodes  remain  to  be  scanned;  other¬ 
wise  stop. 

It  is  clear  that  the  storage  required  for  this  procedure  is  linear 
in  w*  +  M.  If  Step  1  is  sufficiently  simple  and  each  node  is  only 
scanned  once,  the  procedure's  runtime  will  be  dominated  by  Step  2 
and  will  be  proportional  to  Mw*.  This  is  in  fact  the  case,  as  we  will 
see. 

A  node,  j,  must  be  scanned  if  its  label,  d-,  is  improved  (re¬ 
duced)  as  it  may  then  improve  the  labels  of  other  nodes.  If  we  are 
to  ensure  that  each  node  is  scanned  at  most  once,  we  must  defer 
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scanning  it  until  its  iabel  cannot  be  improved.  When  all  arc  lengthb 
are  positive,  as  they  are  here,  this  objective  is  achieved  by  scanning 
the  nodes  in  ascending  order  of  d. .  This  is  Dijkstra's  well  known 
shortest  path  algorithm. 

In  the  general  case,  Dijkstra's  algorithm  requires  that  in  Step  1 
we  find  the  as  yet  unscanned  node  with  the  smallest  label.  If  this 
were  done  naively,  it  would  require  an  examination  of  w*  labels  and 
the  overall  procedure  would  require  (w*)2  +  Mw*  operations.  In  this 
case,  however,  we  can  implement  Step  1  carefully  and  do  much  bet¬ 
ter. 

We  form  "buckets"  of  nodes  to  be  scanned  and  place  nodes  to  be 
scanned  together  in  the  same  bucket  if  they  have  similar  labels.  If 
the  width  of  the  buckets  is  q,  then  we  place  a  node  with  label  d, 
where  (k-l>  <  d  <  kq  in  the  kth  bucket.  We  then  simply  work  our 
way  through  the  buckets  in  ascending  order  of  bucket  number  scan¬ 
ning  the  nodes  in  each  bucket  in  arbitrary  order.  Thus,  there  is  no 
explicit  search  in  Step  1.  If  the  width  of  the  bucket  is  no  greater 
than  the  length  of  the  shortest  arc  in  the  graph,  then  any  node 
labeled  by  the  node  currently  being  scanned  will  reside  in  a  higher 
numbered  bucket  and  hence,  each  node  will  be  scanned  at  most  once. 

The  only  problem  with  this  procedure,  which  is  well  known,  is 
that  the  number  of  buckets  itself  may  become  very  large,  in  particu¬ 
lar  much  larger  than  the  number  of  nodes,  if  the  longest  arc  is  many 
times  greater  than  the  shortest.  In  this  case,  a  great  deal  of  storage 
and  running  time  may  be  wasted  in  dealing  with  empty  buckets.  We 
now  show  that  in  this  case  fewer  than  w*  buckets  are  required  in  ail 


cases . 


We  begin  by  reindexing  the  arcs  out  of  each  node  corresponding 
to  the  M  -  1  remaining  types  of  object  in  increasing  order  of  length. 
(Recall  that  the  arc  corresponding  to  the  best  type  of  object  was  re¬ 
moved  from  explicit  consideration.)  Thus,  Li,  is  the  length  of  the 
arc  which  corresponds  to  the  second  best  type  of  object,  etc.  The 
w^  and  pj  are  reindexed  correspondingly.  We  will  refer  to  arcs  of 
length  L.,  weight  Wj,  and  profit  p.  after  this  reindexing  as  arcs  of 
type  i  (or  as  arcs  corresponding  to  objects  of  type  i) .  For  sim¬ 
plicity,  we  will  assume  Lj  +  L.  for  i  f  j  although  the  procedure  does 
not  require  this. 

Consider  the  case  where  w*  is  relatively  prime  to  wx.  By  the 
definition  above,  arcs  of  length  Lt  are  the  shortest  arcs  in  the 
graph.  Since  wx  and  w*  are  relatively  prime,  a  path  containing  at 
most  w*  -  1  arcs  comprised  entirely  of  arcs  of  type  1  exists  from  0  to 
each  other  node.  Thus,  we  know  that  no  node  need  have  a  label 
greater  than  Lx(w*  -  1).  So,  w*  -  1  buckets  of  width  Lx  suffice 
since  the  shortest  path  will  be  no  longer  than  the  aforementioned  one. 

In  the  case  where  w*  and  wx  are  not  relatively  prime,  the  above 
path  will  loop  back  to  node  0  before  reaching  many  of  the  other 
nodes.  In  this  case,  the  following  preliminary  computation  is  carried 
out  before  setting  up  the  buckets: 

r  =  w* 

For  j  =  1,  M-l 

g.  =  r/GCD(r,w.) 
r  =  GCD(r,Wj) 

where  GCD(i,j)  is  the  greatest  common  divisor  of  the  integers  i  and  j 
and  can  be  found  using  Euclid's  Algorithm,  whose  running  time  is 
bounded  by  the  following  Lemma: 


Lemma  4:  Euclid's  Algorithm  has  a  worst  case  running  time  ot 
order  log  N  when  applied  to  find  the  GCD  of  N  and  M(  for  N  >  M. 

Proof :  For  N  >  M,  Euclid's  Algorithm  replaces  a  problem  on  N 

and  M  by  one  one  M  and  N  -  qM.  The  worst  case  for  this,  i.e.,  the 
one  which  converges  most  slowly,  is  when  q  =  1  at  all  stages.  This 

results  in  a  Fibonacci  sequence.  It  is  well  known  [1]  that  the  K  ‘ 

Fibonacci  number,  F^  is  given  by: 


Thus,  the  runtime  of  the  algorithm  is  of  order  K  for  N  of  order 
F^,  i.e.,  K  is  of  order  log  N. 

We  observe  that,  corresponding  to  the  paths  comprised  entirely 
oi  type  1  arcs  above,  there  is  now  a  tree  rooted  at  C,  spanning  all 
the  nodes,  and  containing  paths  with  at  most  g.  -  1  arcs  of  length 
Lj.  An  illustration  of  such  a  tree  is  given  in  Figure  4.  In  this  case 
only  nodes  0i8,  and  4  can  be  reached  using  only  arcs  of  type  1 

before  the  path  cycles  back  to  node  0.  The  number  of  nodes  so 

reachable  is  gx  (three  in  this  case).  The  original  set  of  w*  nodes  is 
partitioned  into  gx  parts  which  are  then  each  partitioned  into  g2 
smaller  parts  corresponding  to  the  nodes  which  are  now  reachable  via 
arcs  of  type  2.  The  partitioning  continues  until  all  nodes  are 

reached.  The  above  can  be  proven  rigorously  in  the  general  case 
using  the  properties  of  cyclic  groups. 

We  now  define  buckets  of  variable  width  corresponding  to  the 

longest  path  in  the  tree  defined  above.  Thus,  there  are  gx  -  1 
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buckets  of  width  lt  fuliowed  by  g2  -  1  buckets  of  width  12,  etc.  A 
total  of  B  =  £  (g.  -  1)  buckets  is  required.  Since  the  product  of  the 
g.'s  is  w*f  B  <  w*. 

We  have  shown  that  less  than  w*  buckets  are  required.  We  now 
show  that  no  node  is  scanned  more  than  once.  To  do  so,  we  show 
that  the  width  of  each  bucket  is  no  greater  than  the  length  of  the 
shortest  arc  still  permitted  in  a  path. 


Figure  4:  A  spanning  tree  bounding  the  lengths  of  shortest  paths 

Note  that  the  paths  from  0  to  j  correspond  to  ordered  selections 
of  objects  of  aggregate  weight  j;  i.e.,  there  are  distinct  paths  cor¬ 
responding  to  the  selection  of  the  same  objects  in  different  orders. 
We  can,  and  should,  consider  only  one  path  corresponding  to  each 
distinct  collection  of  objects.  This  is  easily  done  by  keeping  track  of 
the  last  type  of  arc  in  the  path  (which  we  are  doing  with  the  variable 


PRj)  and  only  considering  arcs  with  the  same  or  higher  index  (object 
type).  Thus,  we  will  find  solutions  corresponding  to  a  selection  of 
objects  in  non-decreasing  order  of  type. 

Now,  consider  a  node  in  bucket  i  of  width  w. .  This  node  was 
either  reached  via  a  path  in  the  tree  considered  above  or  via  an  arc 
whose  length  is  greater  than  that  of  any  in  that  tree.  In  latter  case, 
the  length  of  the  smallest  arc  under  consideration  exceeds  the  width 
of  any  bucket.  In  the  former  case,  since  we  are  considering  arcs  in 
increasing  order  of  length,  again  a  node  can  only  enter  a  bucket  via 
an  arc  at  least  as  long  as  the  width  of  the  bucket. 

Extensions 

If  W  <  w*w  then  it  is  possible  for  Wj  to  be  equal  to  Wj  and  for 
objects  i  and  j  not  to  dominate  one  another.  In  particular,  if  Wj  >  w^. 
and  £i  <  £.  neither  i  nor  j  may  dominate.  This  is  because  the  con¬ 
straint  on  W  may  prohibit  the  use  of  a  sufficient  number  of  objects 
with  the  smaller  length.  As  an  example  of  this,  consider  a  problem 
with  W  =  24  and  (W.^)  of  (5,50),  (6,58),  (11,109),  (1,0).  Here, 

the  are  0,2,1,  and  10,  and  w2  =  w3  modulo  5.  Neither  object  2 
nor  object  3  dominates  the  other,  however.  In  fact,  the  optimal 
solution  is  4  type-2  objects  in  this  case.  For  W  =  21  or  W  =  22, 
however,  type-3  objects  would  be  used  in  the  optimal  solution. 

For  W  =  23,  both  type-2  and  type-3  objects  would  be  used.  Lemma 

3  can  thus  not  be  extended  directly  to  the  case  whose  W  <  w*w. 

Thus,  it  is  not  possible  to  make  the  final  reduction  to  a  network 
with  w*  nodes  in  this  case.  However,  when  the  individual  w.  are 
small,  which  is  the  principal  case  of  interest  here,  w*w  will  be  small 
as  well.  The  above  algorithm  can  then  still  be  applied  effectively  * 
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the  network  with  W  nodes.  Alternatively,  one  can  view  the  following 
procedure  as  working  on  the  reduced  graph  with  w*  nodes  but  allow¬ 
ing  multiple  labels  on  each  node.  We  adopt  this  latter  point  of  view 
in  describing  the  modified  procedure  below. 

A  label  on  a  node,  i,  is  now  a  couple  (d.,  c.)  where  d.  is  the 
amount  of  wasted  profit  to  get  to  node  i,  i  =  1,2,...  w*-l,  and  c^  is 
the  cycle  number  of  this  label.  The  c.  are  defined  by  the  relation 
i  =  j  -  w*c. 

where  j  is  the  node  in  the  network  Ki  (with  W  nodes)  which  would 

have  received  the  same  label.  A  node  i  in  K2  can  thus  have  several 

labels  corresponding  to  different  c^,  but  for  any  two  labels  (d^,cp 

and  (d-',cp  on  the  same  node  i, 

d.'  >  d.  -*■  c'  <  c- 
1111 

since  as  we  noted  above  the  only  reason  for  considering  a  larger 
would  be  to  obtain  a  smaller  c-.  Similarly,  for  any  pair  of  objects 
with  weights  having  the  same  residue  modulo  w*  the  object  with  the 
larger  weight  must  have  the  smaller  length. 

There  are  several  limitations  on  the  maximum  number  of  objects 
of  any  given  type.  In  particular,  m.,  the  maximum  number  objects  of 
type  i,  is  bounded  by 


m.  <  min 


W 


w. 

1 


w* 


i-1 


'  GCD(w*,r.) 

where  r-  is  the  residue  of  w^  modulo  W*.  The  first  term  on  the  right 
hand  side  follows  from  the  fact  that  any  larger  number  of  type  i 
objects  have  weight  greater  than  W.  The  second  term  follows  by  the 
some  reasoning  is  Lemma  2. 
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Proceeding  along  the  lines  similar  to  those  in  [3]  we  create 
(log2  M.)  +  1  objects  corresponding  to  1,2, 4, 8,...  and  2^  objects  of 
type  i,  where  b  is  largest  power  of  2  that  is  less  than  M- .  We  thus 
now  have  defined  a  new  problem  with  at  most  M  log2  W  objects.  At 
most  one  object  of  each  type  is  permitted  in  the  solution. 

Proceeding  along  lines  similar  to  those  in  [3],  we  can  create 
(log2  M.)  +  1  objects  corresponding  to  1,2, 4, 8... 2^  objects  of  type 
i,  where  b  is  the  largest  power  of  2  that  is  less  than  M. .  We  then 
have  a  problem  with  at  most  M1  =  Mlog2(w*)  objects  where  at  most 
one  object  of  each  type  need  be  considered.  Thus,  we  could  create  M1 
buckets  and  proceed  with  the  algorithm  as  above. 

One  can  refine  this  procedure  by  eliminating  dominated  objects 
from  consideration.  In  particular,  if  two  objects  i  and  j,  have  the 
same  residue  modulo  w*,  and  w;  >  w.  and  £.  <  £.  then  object  j  may  be 
eliminated  from  further  consideration.  One  must  be  careful,  however, 
not  to  allow  one  copy  of  an  object  to  eliminate  another  copy  of  the 
same  object. 

For  example  suppose  there  were  initially  4  types  of  object  with 
(Wi.Pi)  equal  to  (10,20),  (3,5),  (7,4)  and  (1,0)  W  =  10  and  W  <  ww*, 
respectively.  If  W  =  37  we  must  consider  the  modified  procedure. 
We  thus  create  objects  with  (W^P.)  equal  to  (6,10),  (12,20),  and 
(6,10)  corresponding  to  2,4,  and  2  objects  of  type  2.  These  three 
new  objects  together  with  the  original  object  of  type  2  can  be  used  to 
select  anywhere  from  0  to  9  copies  of  the  type  2  object.  The  first 
new  object  is  identical  to  the  third  and  would  dominate  it  unless  we 
specifically  prohibited  this. 
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If  the  W.  and  P.  are  drawn  independently  from  uniform  distribu¬ 
tions,  this  latter  refinement  will  reduce  the  average  number  of  objects 
to  no  more  than  w*£n(M)  objects  and  buckets  as  we  see  from  Lemma 
5. 

Lemma  5:  There  will  be  on  the  average  on  the  order  of  no  more  than: 


W*  £n 


M  log?(W*) 
W* 


objects  left  undominated  after  dominated  objects  are  removed  in  the 
above  procedure. 

Proof:  As  observed  above,  there  are  approximately  Mlog2(W*)  or 

fewer  objects  before  dominance  is  checked.  Suppose  these  objects 
divide  evenly  into  residue  classes.  There  would  then  be  Mlog2(w*)/w* 
objects  in  each  residue  class. 

An  object  can  be  dominated  by  any  other  object  in  the  same 
class.  If  the  objects  are  ordered  in  increasing  order  of  £.,  an  object 
will  be  dominated  unless  its  w.  is  less  than  the  w.  of  all  objects  pre¬ 
ceding  it  on  the  list  since  the  weights  are  uniformly  distributed,  the 
item  in  the  list  will  be  undominated  with  probability  Thus, 
given  a  list  of  length  L  the  expected  number  of  undominated  elements 
remaining  on  the  list  is 
L 

1  l 

j=l  1  2n(L) 

The  result  follows  directly  by  substituting  for  L.  If  the  residue 
classes  do  not  all  contain  the  same  number  of  elements,  the  result  is 
still  true  since  we  observe  that  the  quantity 
£n(Li)  +  £n(L2)  for  Lx  +  L2  =  2L 
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is  maximized  for  Lx  =  L2  =  L,  that  is,  equal  sized  residue  classes 
yield  the  maximum  number  of  objects. 

Thus,  even  for  W  <  w*w,  we  have  a  number  of  buckets  polynomial 
in  w*,  not  W.  The  running  time  is  thus  at  worst  of  orderWw*  <  (w*)2w. 

It  is  possible  to  extend  the  procedure  still  further  to  take  into 
account  restrictions  on  the  number  of  permissible  objects  of  each 
type,  for  some  or  all  of  the  objects.  First,  if  one  is  given  an  a 
priori  restriction,  U.,  on  the  number  of  objects  of  type  i,  one  need 
only  set 

M.  =  Min  (M.,U.) 

where  is  the  maximum  number  of  objects  of  type  is  permitted  and 
the  M-  on  the  right  hand  side  is  computed  as  above.  If  the  a  priori 
maximum  permissible  number  of  best  objects  is  less  than  W/w*  than  a 
fictitious  best  object  with  w*  =  W  +  1  must  be  added  to  the  problem. 
This  maintains  the  validity  of  the  procedure  but  may  reduce  its 
efficiency  considerably  if  the  original  w*  was  much  smaller  than  W. 


CONCLUSIONS 

We  have  presented  an  algorithm  for  the  solution  of  the  simple 
(unbounded)  Knapsack  Problem  whose  running  time  and  storage  are 
functions  only  of  the  size  of  the  "best"  object  weight,  VI*  in  the  case 
where  the  overall  weight  constraint  is  sufficiently  large.  In  the  cases 
where  the  overall  weight  constraint  is  smaller  or  where  restrictions 
exist  on  the  number  of  objects  of  a  given  type,  we  present  an  exten¬ 
sion  of  this  procedure  which  in  practice  is  often  very  efficient  and 
whose  worst  case  running  time  is  no  greater  than  (w*)2w  where  w  is 
the  weightiest  object. 
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ABSTRACT 

We  consider  the  simple  (unbounded)  Knap¬ 
sack  Problem  and  show  that  it  can  be  solved 
using  a  shortest  path  algorithm  requiring  both 
storage  and  running  time  which  are  polynomial 
functions  only  of  the  number  of  types  of  object 
and  the  size  (weight)  of  the  single  largest 
object.  This  is  a  significant  improvement  over 
previous  algorithms  for  the  solution  of  the 
Knapsack  Problem  using  shortest  paths,  which 
typically  require  storage  and  runtime  which  are 
functions  of  the  total  knapsack  size. 

INTRODUCTION 

The  Knapsack  Problem,  which  is  interest¬ 
ing  in  its  own  right,  has  also  received  much 
attention  recently  as  a  means  of  solving  integer 
programming  problems  via  a  relaxation  technique 
using  group  theory.  Formally,  the  Knapsack 
Problem  can  be  stated  as: 

M 

Maximize  z  =  Z  p,x, 
i=l  11 

M 

subject  to  Z  w.x,  «  W 
i=l  11 

Xj  >  0  for  i=l,2,...M 
Xj  integer 

Thus,  we  are  given  M  types  of  object  with 
weight  w.  and  profitability  pi  per  object  of  type 

i.  We  are  required  to  maximize  the  total  profit 
subject  to  an  constraint  that  the  sum  of  the 
weights  of  the  objects  selected  equals  W.  In 
the  simple  (unbounded)  version  of  the  problem 
which  we  consider  here,  there  is  no  restriction 
on  the  number  of  objects  of  each  type  which 
may  be  included  in  the  solution. 

Sometimes  an  inequality  constraint  is  used 
in  place  of  the  equality  constraint  considered 
here;  i.e.,  one  seeks  objects  whose  aggregate 
weight  does  not  exceed  W.  Such  a  problem  can 
be  transformed  into  a  problem  with  an  inequal¬ 
ity  constraint  by  including  an  additional  object 
with  unit  weight  and  zero  profitability.  Here 
we  mil  concentrate  primarily  on  problems  with 
equality  constraints. 

We  assume  that  W  and  the  w.  are  non- 
negative  integers  (or,  more  generally1,  that  they 
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are  commensurate)  and  that  the  p.  are  non¬ 
negative. 

Previous  approaches  to  the  solution  of  this 
problem  include  those  reported  by  Horowitz  and 
Sahni  [1]  using  dynamic  programming  and  impli¬ 
cit  enumeration.  These  approaches  have  been 
found  to  be  effective  for  a  wide  range  of  prob¬ 
lems  but  have  exhibited  excessive  runtimes  for 
problems  with  a  large  number  of  nearly  equally 
valuable  (in  terms  of  cost  per  unit  weight) 
objects.  Also,  their  worst  case  running  times 
are  exponential  in  the  number  of  object  types. 
Denardo  and  Fox  [2]  have  developed  an  ap¬ 
proach  which  overcomes  these  objections  by 
solving  the  problem  as  a  shortest  path  problem 
in  a  network  with  W  nodes  and  W*M  arcs. 
Thus,  their  approach  requires  storage  propor¬ 
tional  to  W  and  runtime  proportional  to  WxM.  If 
W  and  M  are  sufficiently  small,  their  approach  is 
extremely  attractive.  If  W  is  large,  however, 
the  storage,  and  to  a  lesser  extent  the  runtime, 
become  prohibitive. 

Denardo  and  Fox  present  their  algorithm  in 
the  context  of  using  a  Knapsack  Problem  as  an 
integer  programming  relaxation.  In  such  cases, 
W  is  generally  of  the  same  order  as  the  Wj.  We 

extend  their  approach  to  the  case  where  W  is 
very  large  but  the  individual  Wj  are  not,  and 

exploit  the  underlying  cyclic  group  structure  of 
the  problem  to  develop  an  algorithm  with  both 
runtime  and  storage  requirements  a  function  only 
of  the  weight  of  the  largest  single  object  and  the 
number  of  object  types. 

BASIC  PROCEDURE 

For  the  sake  of  clarity,  we  begin  by  des¬ 
cribing  the  basic  procedure  informally.  Exten¬ 
sions,  refinements  and  a  more  formal  description 
are  given  in  later  sections. 

Given  a  particular  Knapsack  Problem  (i.e., 
given  values  for  W,  M  and  the  pairs  (Pj.Wj), 

i=l,...M)  we  can  construct  a  graph,  G,  with 
nodes  J=0,1,...W  and  directed  arcs  e=(i,ra) 
where  m^i+w^  for  some  k.  There  is  an  arc  in  G 

corresponding  to  each  i<W  and  each  k<M  pro¬ 
vided  ra<W.  Figure  1  shows  such  a  graph  for 
M=3,  W=7,  and  the  (pp  Wj)  =  (3,2),  (5,3), 

(10,5).  The  length  of  each  arc  (shown  next  to 
the  arc)  is  the  profit  obtained  by  selecting  the 
type  of  object  corresponding  to  the  arc.  Node 
numbers  correspond  to  weight  used  up  by  the 
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objects  already  selected  a  path  from  node  0  to 
node  |  corresponds  to  a  s<  lection  ol  obiecls  with 
aggregate  weight  )  and  the  length  of  the  path 
(the  sum  ol  the  lengths  of  the  arcs  in  the  path) 
corresponds  to  the  profit  obtained  There 
exists  a  path  corresponding  to  any  leasible 
selection  of  objects  and  hence,  the  longest  path 
from  0  to  W  in  G  corresponds  to  the  optimal 
solution  to  the  Knapsack  Problem.  This  is  the 
essence  of  the  shortest  path  technique  de¬ 
scribed  by  Denardo  and  Fox  among  others. 

It  is  possible  to  accomplish  the  same  opti¬ 
mization,  however,  using  a  graph  containing  a 
much  smaller  number  of  nodes.  Johnson  (31 
observed  that  it  is  not  in  general  necessary  to 
label  all  nodes  in  order  to  obtain  the  optimim 
solution,  but  rather,  that  one  may  stop  when 
node  j=w*w  has  been  reached,  where  w=  max 

(Wj)  and  p*/w*=  max  (p./w.)  ;  i.e.,  v9  is  t Ae 
)  j  J  I 

weight  of  the  "heaviest"  object  and  w*  is  the 
weight  of  the  "best"  object.  We  will  explore 
this  observation  and  exploit  it  in  developing  a 
new,  more  efficient  algorithm. 

The  key  to  proving  the  validity  of  the 
above  observation  is  that  if  an  arbitrary  solu¬ 
tion  contains  one  or  more  objects  whose  aggre¬ 
gate  weight  is  a  multiple,  say  q,  of  w*,  then  a 
new  solution  no  worse  than  the  first  can  be 
obtained  by  replacing  these  objects  by  q  best 
objects.  Such  a  replacement  leaves  the  total 
weight  unaltered  and  cannot  decrease  the  total 
profit.  Thus,  we  need  only  consider  partial 
solutions  which  do  not  contain  any  objects 
whose  aggregate  weight  is  a  multiple  of  w* 
The  appropriate  number  of  best  objects  can  be 
added  to  complete  any  such  solution  at  the  end 
of  the  procedure. 


There  can  be  at  most  w*-l  objects  in  any 
such  partial  solution.  To  see  this,  consider  a 
partial  solution  containing  b  objects  with  weights 

w(  1 ) ,  w<2),  ...  w(b).  Form  the  sums  Sj=(Iw(i>) 

'  i=l 

modulo  w*.  The  Sj  must  all  be  different  and 

must  all  be  integers  in  the  range  1  to  w*-l. 
Otherwise,  the  b  objects  contain  a  subset  of 
«Q9r*9ate  weight  qw*  since  : 


and 


J 

s,  =  0  ■*  2w(i)  =  qw* 

1  1=1 

k 

s,  =  su  •*  2w(i)  =  qw*  for  j  <  k 
>  K  i=j+l 


Note  that  it  is  in  fact  possible  for  such  a 
partial  solution  to  contain  w*-3  objects,  for 
instance  w*-l  objects  with  weight  w.  when  w^ 

and  w*  are  relatively  prime  (i.e.,  contain  no 
common  factors  greater  than  1),  so  this  bound 


on  t he  maximum  number  of  objects  is  tight  at 
least  in  some  cases. 

Since  vi  is  the  weight  of  the  largest  ob- 
ieri,  Johnson's  observation  is  proven.  Thus, 
one  could  rejilace  G  by  another  graph  with  a 
similar  structure  but  with  only  w(w*-l)*l  nodes 
and  thus  improve  both  the  memory  and  runtime 
performance  of  the  procedure  if  the  new  number 
of  nodes  is  less  than  W. 

Using  similar  reasoning,  we  go  substantially 
further.  First,  we  redeline  the  lengths  of  the 
arcs  in  G  above  to  represent  the  amount  of 
wasted  profit  relative  to  a  (not  necessarily 
realizable)  path  comprised  entirely  of  best  arcs. 
Note  that  this  redefinition  of  arc  lengths  re¬ 
verses  the  previous  ordering  of  the  lengths  of 
different  paths  between  nodes  0  and  any  other  j. 
The  optimal  solution  is  thus  now  the  shortest 
path  from  0  to  W  in  the  new  G,  with  arc  lengths 
given  by  w^  (pVw*  -  p^)  for  arcs  which  cor¬ 
respond  to  type  k  objects.  Note  that  all  arcs 
have  non-negative  length  and  best  arcs  are  the 
only  ones  that  have  zero  length.  For  simplicity, 
we  will  assume  that  the  best  arc  type  is  unique; 
the  case  of  multiple  best  arc  types  is  in  fact 
easy  to  handle  in  the  procedure. 

We  now  note  that  every  pair  of  nodes  j  and 
j*w*  in  G  are  connected  by  a  zero  length  arc 
and  hence  that  if  a  path  of  length  L  from  0  to  j 
exists  then  a  path  of  length  L  from  0  to  j-*w* 
exists  too.  For  w  sufficiently  large,  in  parti¬ 
cular,  for  W>w*w,  it  is  sufficient  to  consider  a 
new  graph.  H,  with  only  w*  nodes  corresponding 
to  the  original  node  numbers  modulo  w*.  This 
gives  rise  to  a  graph  of  the  type  shown  in 
Figure  2  for  the  same  example  giver,  in  Figure  1 
except  that  W  is  now  assumed  to  be  much  larger. 
(This  graph  is  in  fact  a  Cayley  color  graph  for 
the  cyclic  group  of  order  w».) 


Figure  1.  A  graph  for  Figure  2  Cyclic  graph 
solving  the  for  Knapsack 

Knapsack  Problem 

Problem 
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Note  that  this  graph  contains  arcs  both 
from  higher  numbered  nodes  to  lower  numbered 
nodes  and  vice  versa,  unlike  G  which  contained 
only  arcs  from  lower  numbered  nodes  to  higher 
numbered  nodes.  Also,  note  that  arcs  appear  as 
zero  length  self  loops  and  can  thus  be  ignored. 
Since  all  remaining  arcs  are  of  positive  length, 
all  cycles  are  of  positive  length  and  the  shortest 
path  problem  is  well  defined.  Thus,  we  have 
almost  reduced  the  problem  to  one  of  finding  a 
shortest  path  in  a  graph  with  w*  nodes  instead 
of  W  nodes,  the  resultant  path  augmented  by  the 
appropriate  number  of  best  arcs  to  form  the 
optimal  solution. 

The  only  loose  end  to  tie  up  is  what  to  do 
in  the  case  where  W  <  w*W.  In  this  case,  it  is 
possible  that  the  shortest  path  found  in  H  above 
may  correspond  to  a  path  to  a  node  greater  than 
W;  i.e.,  the  "appropriate"  number  of  best  arcs 
is  negative.  Put  another  way,  the  reduction 
from  G  to  H  was  based  on  the  assumption  that 
node  j  is  dominated  by  node  j+w*  in  G  since  the 
lower  numbered  node  could  always  extend  its 
paths  through  the  higher  numbered  node  via  a 
zero  length  arc.  This  is  not  true,  however,  if 
J*w*  >  W-W  since  a  path  to  W  may  exist  from  the 
lower  numbered  node  but  not  from  the  higher 
numbered  node. 

We  can  handle  this  problem  by  adding  an 
additional  W  nodes  to  H,  corresponding  to  the 
nodes  W-W+l,  ...  W  in  G  and  adding  arcs  from 
appropriate  nodes  0.  ...  w*-l  to  these  new 
nodes  corresponding  to  each  type  of  object. 
Finally,  we  add  arcs  between  lower  numbered 
nodes  and  higher  numbered  nodes  correspond* 
ing  to  each  type  of  object,  exactly  as  in  G. 
This  entire  transformation,  which  at  first  ap¬ 
pear  complex,  amounts  simply  to  collapsing  G 
by  removing  the  nodes  in  the  range  w*  through 
W-W  since  the  removed  nodes  are  not  essential 
to  finding  the  optimal  solution. 

Figure  3  shows  part  of  such  a  graph  in  a 
case  where  w**5,  W=6,  and  W*22.  Shown  are 
only  the  arcs  corresponding  to  an  object  with 
weight  2.  Thus,  even  with  this  complication, 
both  the  runtime  and  storage  have  been  re¬ 
duced  to  a  polynomial  in  M,  w*.  and  W;  i.e., 
there  is  no  functional  dependence  on  W.  Note 
that  if  W  <  w**W  then  we  simply  solve  the 
problem  using  G;  no  reduction  is  necessary. 

In  the  sequel,  we  will  limit  the  discussion 
to  the  case  where  W  >  w*W.  The  case  where  W 
<  w*W  follows  by  the  above  transformation, 
adding  storage  and  runtime  proportional  to  W. 
The  final  phase  of  the  procedure  in  this  case  is 
an  ordinary  shortest  path  algorithm  where 
nodes  w*  through  w*+W  are  each  scanned  se¬ 
quentially. 

ALGORITHM  DESCRIPTION 

We  now  describe  the  actual  procedure  in 
more  detail  and  analyze  its  storage  and  runtime. 


Figure  3.  A  graph  for  solving  the  Knapsack 
Problem  when  W  <  w*  w 

The  following  is  the  essence  of  the  procedure: 
Step  0:  (Initialization) 
dj,  *  ° 

dj  *  •  i=l , . . .  w*-l 

where  dj  is  the  current  estimate  of  the 
length  of  the  shortest  path  from  O  to 

J. 

Step  1:  Find  i,  the  best  node  to  scan  next. 

This  is  discussed  in  detail  below. 

Step  2:  (Scan  node  1) 

For  k=2,  ...  M 

\ 

Let  J=(i+wk)  modulo  w* 

If  dj>dj  ♦  Lk  then  d.=dj  ♦  Lk  and 
PRj*k 

where  Lk  is  the  length  of  an  arch 

corresponding  to  a  type  k  object 
(Ljc=wk(p*/w*  -  p^)  as  defined  above) 

and  PRj  records  the  type  of  the  arc 

used  to  label  node  J  and  hence  keep 
track  of  the  optimal  solution. 

Step  3:  Return  to  step  1  if  any  nodes  remain 
to  be  scanned;  otherwise  stop. 

It  is  clear  that  the  storage  required  fqr 
this  procedure  is  linear  in  w*+M.  If  Step  1  is 
sufficiently  simple  and  each  node  is  only  scanned 
once,  the  procedure's  runtime  will  be  dominated 
by  Step  2  and  will  be  proportional  to  Mw*.  This 
is  in  fact  the  case,  as  we  will  see. 
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A  node.  ).  must  be  scanned  if  its  label. 
d(.  is  improved  (i educed)  as  it  may  then  im¬ 
prove  the  labels  of  other  nodes.  If  we  are  to 
ensure  that  eacn  node  is  scanned  at  most  once, 
we  must  deler  scanning  it  until  its  label  cannot 
be  improved  When  all  arc  lengths  are  posi¬ 
tive.  as  they  are  here,  this  objective  is  achiev¬ 
ed  by  scanning  the  nodes  in  ascending  order  of 
d..  This  is  Dijkstra's  well  known  shortest  path 

algorithm. 

In  the  general  case,  Dijkstra's  algorithm 
requires  that  in  Step  1  we  find  the  as  yet  un¬ 
scanned  node  with  the  smallest  label.  If  this 
were  done  naively,  it  would  require  an  examina¬ 
tion  of  w*  labels  and  the  overall  procedure 
would  require  (w*)2  ♦  Mw*  operations.  In  this 
case,  however,  we  can  implement  Step  1  care¬ 
fully  and  do  much  better. 

We  form  "buckets"  of  nodes  to  be  scanned 
and  place  nodes  to  be  scanned  together  in  the 
same  bucket  if  they  have  similar  labels.  If  the 
width  of  the  buckets  is  q,  then  we  place  a  node 

with  label  d,  where  (k-1)  <  d  <  kq  in  the  k1^ 
bucket  We  then  simply  work  our  way  through 
the  Duckets  in  ascending  order  of  bucket  num¬ 
ber  scanning  the  nodes  in  each  bucket  in 
arbitrary  order.  Thus,  there  is  no  explicit 
search  in  Step  1.  If  the  width  of  the  bucket  is 
no  greater  than  the  length  of  the  shortest  arc 
in  the  graph,  then  any  node  labeled  by  the 
node  currently  being  scanned  will  reside  in  a 
higher  numbered  bucket  and  hence,  each  node 
wiil  be  scanned  at  most  once. 

The  only  problem  with  this  procedure, 
which  is  well  known,  is  that  the  number  of 
buckets  itsell  may  become  very  large,  in  par¬ 
ticular  much  larger  than  the  number  of  nodes, 
if  the  longest  arc  is  many  times  greater  than 
the  shortest.  In  this  case,  a  great  deal  of 
storage  and  running  time  may  be  wasted  in 
dealing  with  empty  buckets.  We  now  show  that 
in  this  case  fewer  than  w*  buckets  are  required 
in  all  cases. 

We  begin  by  reindexing  the  arcs  out  of 
each  node  corresponding  to  the  M-l  remaining 
types  of  object  in  increasing  order  of  length, 
(hecall  that  the  arc  corresponding  to  the  best 
type  of  arc  was  removed  from  explicit  consid¬ 
eration.)  Thus,  Lj ,  is  the  lenth  of  the  arc 

wr'‘ic"  corresponds  to  the  second  best  type  of 
object,  etc  The  vc  and  Pj  are  reindexed 

correspondingly  We  will  refer  to  arcs  of 
length  L(,  weight  w. ,  and  profit  p;  after  this 

reindexing  as  arcs  of  type  i  (or  as  arcs  cor¬ 
responding  to  objects  of  type  i).  Tor  sim¬ 
plicity,  we  wili  assume  Lj  ¥  Lj  for  i*j  although 

the  piocedure  does  not  require  this. 

Consider  the  case  where  w*  is  relatively 
prime  to  Wj .  by  the  definition  above,  arcs  of 


length  I.,  are  the  shortest  arcs  in  the  graph. 

and  w»  are  relatively  prime,  a  path 

bincc  w l 

containing  at  most  w*-l  arcs  comprised  entirely 
of  arcs  ol  type  1  exists  from  0  to  each  other 
node.  Thus,  we  know  that  no  node  need  have  a 
label  greater  th3n  Lj(w*-1).  So.  w*-l  buckets 

of  width  Lj  suffice  since  the  shortest  path  will 

be  no  longer  than  the  aforementioned  one. 

In  the  case  where  w*  and  w.  are  not  rela¬ 
tively  prime,  the  above  path  will  loop  back  to 
node  0  before  reaching  many  of  the  other  nodes. 
In  this  case,  the  following  preliminary  computa¬ 
tion  is  carried  out  before  setting  up  the  buck¬ 
ets: 

r=w* 

For  j=l.  M-l 

gj=  r/GCD(r,Wj) 
r=GCD(r,Wj) 

where  GCD(i,j)  is  the  greatest  common  divisor  of 
the  integers  l  and  j  and  can  be  found  using 
Euclid's  Algorithm. 

We  observe  that,  corresponding  to  the 
paths  comprised  entirely  of  type  1  arcs  above. 
there  is  now  a  tree  rooted  at  0,  spanning  all  the 
nodes,  and  containing  paths  with  at  most  g .-1 

arcs  of  length  Lj.  An  illustration  of  such  a  tree 

is  given  in  Figure  4.  In  this  case  only  nodes 
0,8,  and  4  can  be  reached  using  only  arcs  of 
type  1  before  the  path  cycles  back  to  node  0. 
The  number  of  nodes  so  reachable  is  gj  (three 

in  this  case).  The  original  set  of  w*  .nodes  is 
partitioned  into  gj  parts  which  are  then  each 

partitioned  into  g0  smaller  parts  corresponding 

to  the  nodes  which  are  now  reachable  via  arcs  of 
type  2.  The  partitioning  continues  until  all  nodes 
are  reached.  The  above  can  be  proven  rigor¬ 
ously  in  the  general  case  using  the  properties  of 
cyclic  groups. 

We  now  define  buckets  of  variable  width 
corresponding  to  the  longest  path  in  the  tree 
defined  above  Thus,  there  are  gj-1  buckets  of 

width  Lj  followed  by  g-,-1  buckets  of  width  L2, 
etc.  A  total  of  B=  1  ( g(-l )  buckets  is  required. 

Since  the  product  of  the  g . 's  is  w*.  B  <  w*. 

We  have  shown  that  less  than  w*  buckets 
are  required.  We  now  show  that  no  node  is 
scanned  more  than  once.  To  do  so,  we  show 
that  the  width  of  each  bucket  is  no  greater 
than  tne  length  of  the  shortest  arc  still  permit¬ 
ted  in  a  path. 
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Figure  4:  A  spanning  tree  bounding  the 
lengths  of  shortest  paths 

Note  that  the  paths  from  0  to  j  correspond 
to  ordered  selections  of  objects  of  aggregate 
weight  j;  i.e.,  there  are  distinct  paths  cor¬ 
responding  to  the  selection  of  the  same  objects 
in  a  different  order.  We  can,  and  should,  con¬ 
sider  only  one  path  corresponding  to  each  dis¬ 
tinct  collection  of  objects.  This  is  easily  done 
by  keeping  track  of  the  last  type  of  arc  x  in 
the  path  (which  we  are  doing  with  the  variable 
PR.)  and  only  considering  arcs  with  the  same  or 
higher  index  (object  type).  Thus,  we  will  find 
solutions  corresponding  to  a  selection  of  objects 
in  non-decreasing  order  of  type. 

Now,  consider  a  node  in  bucket  i  of  width 
Wj.  This  node  was  either  reached  via  a  path  in 
tHe  tree  considered  above  or  via  an  arc  whose 
length  is  greater  than  that  of  any  in  that  tree. 
In  latter  case,  the  length  of  the  smallest  arc 
under  consideration  exceeds  the  width  of  any 
bucket,  in  the  former  case,  since  we  are  con¬ 
sidering  arcs  in  increasing  order  of  length, 
again  a  node  can  only  enter  a  bucket  via  an  arc 
at  least  as  long  as  the  width  of  the  bucket. 

CONCLUSIONS  AND  REFINEMENTS 

The  basic  procedure  as  stated  is  seen  to 
have  a  storage  requirement  proportional  to 
w*+w+M  in  the  worst  case  and  runtime  propor¬ 
tional  to  (w*+w)M  in  the  worst  case.  This  is  a 
significant  improvement  over  procedures  of  this 
type  currently  in  use.  Several  refinements  can 
be  made  to  the  basic  procedure,  however,  to 
improve  its  performance  still  further  in  specific 
cases. 

If  Wj=Wj  snd  Pj<Pj,  object  1  can  be  eliminat¬ 
ed  from  consideration.  In  fact,  if  wi  and  wj  are 
congruent  to  each  other  modulo  w*  and  P^Pj, 

object  i  can  be  eliminated  from  consideration  in 
paths  between  nodes  i  and  j  in  the  range  0  to 
w*-l  (.but  not  other  nodes  in  the  case  W<w*W.) 


Notr  thui  at  most  w*-l  types  ol  ans  can  thus 
remain  beiwo-n  the  nodes  0  and  v»*-l  and  both 
the  storage  and  running  time  become  (unctions 
only  of  w*.  not  w  or  M  (In  the  rase  of  in¬ 
equality  constraints,  this  dominance  can  be 
pushed  still  farther  and  object  i  can  be  eliminat¬ 
ed  if  w(  j  Wj  and  p(  <  . ) 

Since  the  shortest  path  to  any  node  is 
bounded  by  v-  (g(-l  )L. ,  when  scanning  a  node 

with  label  b,  arcs  of  length  greater  than  v-b 
can  be  eliminated  from  consideration.  Since  arcs 
are  scanned  in  ascending  order  of  length,  this  is 
easily  implemented  by  terminating  the  scan  when 
the  first  such  arc  is  encountered. 

We  are  currently  carrying  out  computational 
experiments  to  compare  the  efficiency  of  this 
procedure  with  others  currently  in  use. 
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Abstract 

Alternating  chain  procedures  can  be  thought  of  as  generalizations  of  the  greedy 
algorithm  in  that  instead  of  accepting  the  best  remaining  element,  they  seek  to 
obtain  a  better  augmentation  by  examining  a  wider  range  of  alternatives.  It  is 
possible  to  generalize  the  notion  of  an  augmenting  sequence  to  include  augmentations 
which  are  in  effect  trees  as  opposed  to  simply  paths  such  that  these  augmentations 
are  sufficient  to  guarantee  optimality.  Unfortunately,  in  the  worst  case,  these 
trees  are  of  exponential  size.  We  examine  the  application  of  such  generalized 
augmenting  sequences  to  the  solution  of  NP-complete  problems  and  examine  their 
effectiveness  and  efficiency. 

Introduction 

The  theory  of  NP-completeness,  which  was  first  expounded  by  Cook  [1],  has  led 
to  a  search  for  a  unified  treatment  of  combinatorial  optimization  problems.  Cook 
was  able  to  characterize  a  very  large  class  of  interesting  and  important  problems 
as  being  equivalent  in  the  sense  that  an  efficient  algorithm  capable  of  finding  an 
optimal  solution  to  any  one  of  these  problems  can  be  used  to  obtain  optimal  solu¬ 
tions  to  all  of  the  others.  Many  papers  by  many  authors  and  an  excellent  com¬ 
pendium  (2]  of  problems  in  this  class  (as  well  as  techniques  for  proving  that  a 
problem  is  in  this  class)  have  been  published  since  Cook's  seminal  paper.  Prob¬ 
lems  in  this  class  are  called  NP-complete  problems  (or,  more  properly,  NP-hard  when 
they  are  optimization  problems  as  opposed  to  decision  problems). 

Cook's  results  can  be  interpreted  in  several  ways.  One  of  these  is  to  say  that 
many  clever  people  have  spent  many  years  trying  and  failing  to  find  efficient 
algorithms  for  individual  problems  in  this  class.  Surely  one  of  them  would  have 
succeeded  if,  in  fact,  such  algorithms  existed.  Hence,  it  is  unlikely  that  such  an 
algorithm  will  be  found  and  it  is  tempting  to  stop  looking  for  one.  This  leads  to 
the  development  of  heuristics  for  the  solution  of  such  problems  |3]  and  to  proba¬ 
bilistic  methods  [4], 

An  alternate  interpretation  is  that  this  pessimistic  view  is  justified  only  with  re¬ 
spect  to  algorithms  which  guarantee  optimal  solutions  and  reasonable  runtimes  for 
all  instances  (input  data  sets)  of  a  problem.  In  this  paper  we  speak  of  an  algo¬ 
rithm's  runtime  being  reasonable  if  it  grows  polynomially  rather  than  exponentially 
with  the  size  of  the  problem.  This  does  not  preclude  the  existence  of  algorithms 
with  guaranteed  reasonable  runtimes  and  which  yield  optimal  (or  near-optimal) 
solutions  with  high  probability.  Nor  does  it  preclude  the  existence  of  algorithms 


which  guarantee  optimal  solutions  and  which  have  reasonable  runtimes  with  high 
probability.  There  are  many  examples  of  both  types  of  algorithms  which  are  used 
in  practice  to  solve  specific  NP-complete  problems.  Most  important,  the  theory  of 
NP-completeness  does  not  preclude  or  even  lessen  the  likelihood  of  the  existence  of 
algorithms  which  solve  specific  (nontrivial)  instances  of  a  problem  and  guarantee 
both  an  optimal  solution  and  reasonable  runtime. 

In  this  paper,  we  explore  this  second,  more  optimistic,  point  of  view  and  present 
a  family  of  algorithms  for  the  solution  of  an  NP-complete  problem.  Some  algorithms 
in  this  family  have  guaranteed  reasonable  runtimes.  Others  guarantee  optimal 
solutions.  While  the  algorithms  are  presented  for  the  solution  of  a  specific  prob¬ 
lem,  the  technique  can  be  extended  to  the  solution  of  other  problems  as  well. 


Matroid  Theory 


A  specific  way  of  approaching  the  solution  of  many  combinatorial  optimization 
problems  is  via  matroid  theory.  The  excellent  book  by  Lawler  [5]  gives  a  complete 
treatment  of  this.  Here  we  outline  the  fundamentals  of  this  theory  which  are 
necessary  f«r  the  presentation  which  follows. 


A  matroid  is  a  couple  (E.F)  where  E  is  a  finite  set  of  m  elements: 


E  -  {ej  |  j  =  1.2,  .  .  .  M) 

and  F  is  a  family  of  independent  subsets  of  E.  The  notion  of  independence  is 
quite  general.  We  require,  however,  that  it  satisfy  two  properties: 

PI:  Every  subset  of  an  independent  set  is  independent,  i.e.,  if 


I  e  F  and  ]  Q  I  then  J  e  F 


P2:  If  Ip  and  Ip+^  are  independent  subsets  of  E  containing  P  and  P  ♦  1 
elements,  respectively,  then  there  exists  an  element,  e  t  Ip+1  (e  i  Ip) 
such  that  Ip  '■-'(e]  is  an  independent  set  containing  P  ♦  1  elements. 

Given  two  matroids,  (E.F!)  and  (E,F2),  defined  on  the  same  set  of  elements,  but 

using  two  different  notions  of  independence,  we  define  an  intersection  of  them  to 
be  any  subset,  1(2  E,  such  that  I  t  Ft,  and  1  t  F2.  This  definition  can  be  ex¬ 
tended  to  cover  three  or  more  matroids  as  well. 

Many  combinatorial  optimization  problems  can  be  thought  of  as  finding  the  best 
independent  set  in  a  matroid  or  the  best  intersection  of  two  or  more  matroids.  If 
weights,  w.,  are  associated  with  the  elements,  e.,  in  E,  then  one  can  speak  of  the 

best  set  as  being  the  one  with  largest  total  weight.  The  maximal  (or  minimal) 

spanning  tree  problem  can  be  thought  of  as  finding  the  maximum  (or  minimum) 
weight  independent  set  in  a  matroid  (F,F)  where  E  is  the  set  of  edges  in  the 
graph  and  F  is  the  family  of  forests.  A  forest  is  defined  to  be  a  set  ol  0  or  more 
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edges  which  do  not  contain  a  circuit.  As  another  example,  I.awler  |5,  p.  304) 
shows  that  the  Traveling  Salesman  Problem  can  be  thought  of  as  finding  the  best 
intersection  of  three  matroids.  The  problem  of  finding  the  maximum  weight  inter¬ 
section  of  three  matroids  has  been  shown  to  be  NP-complete  |2).  Lawler  shows  [5, 
p.  364]  that  the  problem  of  finding  intersections  of  four  or  more  matroids  can  be 
reduced  to  that  of  finding  intersections  of  three.  There  are  many  other  combina¬ 
torial  optimization  problems  which  can  be  naturally  thought  of  as  matroid  intersec¬ 
tion  problems.  The  theory  of  NP-completeness  assures  us  that  all  problems  can  be 
thought  if  in  this  way. 

We  will  consider  one  of  the  simplest  possible  3-Matroid  Intersection  Problems  in  the 
sequel  for  the  sake  of  clarity.  The  problem  considered  is  the  Three  Dimensional 
Assignment  Problem  (TDAP).  In  this  problem,  we  are  given  N  people,  N  jobs, 
and  N  days.  There  is  a  cost,  C-jk  of  having  person  i  doing  job  j  on  day  k.  Each 
person  is  to  do  only  one  job,  each  job  is  to  be  done  only  once,  and  only  one  job 
is  to  be  done  on  a  day.  Formally  the  problem  is: 


Minimize  Z  =  i  C...  X... 

i,j,k  1Jlt 


such  that 


V  -  Xilk  -  Xijk  =  1  for  i,j,k  =  1.2.  -■ 


.  N  Xjj|t  e  {0,1} 


Thus,  setting  X^  to  1  corresponds  to  having  person  i  do  job  j  on  day  k.  This 
problem  can  be  viewed  as  an  intersection  of  three  partition  matroids.  Given  a  set 
of  elements,  E  (in  this  case,  the  X^),  a  partition  matroid  can  be  defined  by  a 
partition  of  E  and  a  vector,  A,  constraining  the  number  of  elements  of  E  which 
may  be  selected  from  any  part  of  the  partition.  Formally,  we  have  the  partition  of 
E  into  subsets  Ej,  j  =  1,  ...  k,  where 

U  Ej  =  E  and  Ej  e.  =  4  for  i  /  j 

i 

and  an  integer  vector  A  *  {a^l  j  =  1,  ...  k} 

A  matroid  (E,F)  is  then  defined  where  F  consists  of  all  subsets,  1,  of  E  formed  by 
selecting  no  more  than  a^  elements  of  Ej. 

In  the  case  of  the  TDAP,  the  first  partition  of  the  Xjjk  is  by  person,  i . e . , 

Ei  *  <xijk,j  =  •  •  N;  k  =  1.  Ni 

and  a ■  =  1  for  all  i.  The  independent  sets  in  this  first  matroid  correspond  to 
assigning  each  person  at  most  one  job.  Similarly,  two  more  matroids  can  be  de¬ 
fined  to  constrain  jobs  and  days.  Intersections  of  these  three  matroids  correspond 
to  feasible  partial  assignments  and  intersections  of  maximum  cardinality  correspond 
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to  feasible  complete  assignments.  If  we  define  weights  W..^  associated  with  the 
Xijk; 


W 


ijk 


C  *  Cijk 


where  C  is  larger  than  any  C^,  then  the  maximum  weight  intersection  corre¬ 
sponds  to  the  optimal  solution  to  the  TDAP. 


Augmenting  Paths 

We  now  define  a  family  of  algorithms  for  the  solution  of  matroid  intersection  prob¬ 
lems.  These  are  generalizations  of  the  basic  procedure  given  in  [6J. 

Given  a  matroid  (E,F)  (and  hence  a  notion  of  independence)  and  a  (not  necessarily 
independent)  subset  S,  of  E,  we  define  the  span  of  S,  denoted  sp(S),  as  S  to¬ 
gether  with  all  elements  of  £  not  independent  of  the  elements  in  S,  that  is 


sp(S)  *  {  e  |  I  v/{e}  i  F  where  1  is  any  independent  subset  of  S) 

If  S  is  an  independent  set  and  e  c  sp(S)  then  e  forms  a  unique  cycle,  which  we 
denote  by  C(e),  with  S.  A  cycle  is  a  dependent  set  which  becomes  independent  if 
any  element  is  removed  from  it. 

If  the  matroid  intersection  problem  only  involves  two  matroids,  we  can  obtain 

a  maximum  weight  intersection  by  producing  a  sequence  cf  intersections,  P  ' , 
containing  K  elements,  for  K  =  1,2,  ...  m.  Each  is  the  maximum  weight 

(IQ 

intersection  containing  K  elements.  The  algorithm  which  produces  the  I  is 
called  an  augmenting  path  procedure  because  it  augments  P  '  to  produce  I'  ’ 
by  finding  the  longest  path  in  the  graph  G^K^  defined  below. 

We  define  Gv  to  be  a  bipartite  graph  with  nodes  corresponding  to  the  elements, 
Cj,  of  E  plus  distinguished  start  and  finish  nodes,  a  and  z.  Directed  arcs  are 
defined  as  follows. 


<a,i) 

i  e  E  -  sp,(I(K)) 

(U> 

i  c  E  -  I(K\  j  e  C(2)(i) 

(1.2) 

i  e  E  -  sp,(I(K)) 

(j.i) 

i  c  E  -  1(K),  j  e  C(1)(i) 

Paths  from  a  to  z  correspond  to  augmentations  of  that  is,  to  sets  of  elements 

/  \ 

to  be  added  or  deleted  from  I'  '  to  produce  an  intersection  with  K  ♦  1  elements. 

(IQ 

Notice  that  all  a  to  z  paths  go  alternately  through  nodes  not  contained  in  P  ' 
(which  are  to  be  added  to  and  nodes  in  I^X^  (which  are  to  be  deleted). 

Note  also  that  there  is  one  more  node  of  the  former  type  than  there  is  cf  the 
latter  and  hence  an  augmentation  results.  If  we  associate  lengths  with  the  arcs 
equal  to  the  weights  of  the  elements  which  the  nodes  correspond  to  (positive  for 
elements  to  be  added  and  negative  for  elements  to  be  deleted),  then  the  length  of 
a  path  corresponds  to  the  incremental  weight  of  the  augmentation.  The  longest 
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path  results  in  an  optimal  augmentation.  Such  a  path  can  be  found  using  a  short- 

00 

est  path  algorithm  suitably  modified  to  find  longest  paths.  G  contains  no 
positive  cycles  and  so  the  algorithm  converges. 

These  augmentations  do,  indeed,  result  in  intersections.  As  one  passes  through 
nodes  from  a  to  2  we  see  that  an  element  is  added  preserving  independence  in  the 
first  matroid  but  not  the  second.  An  element  is  then  deleted  restoring  indepen¬ 
dence  in  the  second  matroid  and  hence  the  intersection.  A  node  is  then  added 
which,  because  of  the  deleted  node,  maintains  independence  in  the  first  matroid. 
This  process  continues  until  the  added  element  maintains  independence  in  the 
second  matroid  as  well  as  the  first,  thus  completing  the  augmentation. 

As  an  example,  consider  a  two  dimensional  assignment  problem  (involving,  say, 
only  people  and  jobs.)  The  WV's  for  this  problem  are  given  in  Figure  1.  is 

clearly  11,22,  i.e.,  person  1  assigned  to  job  1,  and  person  2  assigned  to  ^  ^ 

is  shown  in  Figure  2.  The  arc  lengths  are  shown  as  are  the  lengths  of  the 

longest  paths  to  each  node  from  node  a.  The  longest  a  to  2  path  is  a,  11,12, 22, 23, 2 

which  corresponds  to  deleting  11  and  22  from  the  intersection  and  adding  31,12, 

(3) 

and  23.  The  length  of  this  path,  7,  is  the  difference  between  the  weight  of  I'  ' 
and  1^.  A  complete  description  of  this  process  and  a  proof  of  its  validity  is 
given  in  (5J. 

Generated  Auqumenting  Paths 

In  the  graph  shown  in  Figure  2,  one  can  obtain  an  optimal  augmentation  (i.e.,  one 
which  takes  us  from  an  optima!  assignment  of  K  elements  to  an  optimal  assignment 
of  K  *  1)  because: 

1.  If  the  current  intersection  is  not  maximal  then  an  augmenting  path  exists. 

2.  The  labels  given  to  the  nodes  during  the  longest  path  algorithm  completely 
summari2e  the  augmenting  paths. 

We  now  wish  to  generalise  the  notion  of  an  augmenting  path,  and  hence  the  entire 
procedure,  to  the  problem  of  the  intersection  of  three  matroids.  One  way  of  doing 
this  is  to  "freeze"  one  of  the  matroids  and  only  consider  alternating  sequences 
within  the  other  two.  In  this  case  the  first  node,  st  in  an  augmenting  path  would 
be  independent  of  in  two  of  the  three  matroids  (or  in  all  three,  in  which  case 

it  is  the  only  node  in  the  augmenting  path).  Say  s,  is  independent  of  1^*^  in  the 
first  and  third  matroids.  We  could  then  freeze  the  third  matroid  and  maintain  the 
same  span  within  the  third  matroid  throughout  the  augmenting  path.  Thus,  the 
deletion  of  for  i  even  reduces  this  span  and  the  addition  of  s)  for  i  odd  restores 
it.  We  thus  reduce  the  search  space  to  two  matroids  and  the  same  polynomial 
bounded  procedure  will  work.  Note  that,  alternatively,  we  could  have  considered 
the  first  matroid  frozen.  Indeed,  it  is  so  frozen  in  the  two  matroid  intersection 
algorithm  Thus,  there  are  three  types  of  augmenting  paths,  one  for  each  matroid 
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within  which  st  is  dependent.  Unfortunately,  while  this  procedure  is  polynomial 
bounded,  it  does  not  guarantee  optimal  solutions  as  there  are  augmentations  which 
have  no  such  corresponding  argumenting  path. 

In  order  to  guarantee  that  all  augmentations  are  explored,  we  must  relax  the 
definition  of  an  augmenting  path  still  further  to  include  cases  where  independence 
is  not  necessarily  restored  by  the  deletion  of  s.  for  i  even.  Thus,  an  augmenting 
path  may  start  with  any  element,  s,,  which  is  independent  of  in  at  least  one 
of  the  matroids.  Unlike  the  procedure  given  for  two  matroids,  one  may  begin  with 
independence  in  any  matroid.  Consider  the  graph  shown  in  Figure  3  corresponding 
to  two  augmenting  paths,  Path  1  and  Path  2,  for  the  partial  assignment  111,222,333 
(i.e.,  person  1  to  job  1  on  day  1,  etc.)  in  a  TDAP.  These  paths  arc  not  strictly 
comparable  in  that  they  exclude  different  elements  along  the  way.  Thus  in  Figure 
2,  when  node  22  is  labeled  using  the  path  a, 32, 22  it  is  equivalent  (in  terms  of  how 
the  path  can  continue,  not  necessarily  in  terms  of  the  numerical  value  of  the  label) 
to  being  labeled  using  the  path  a, 31, 11, 12, 22.  In  Figure  3,  however,  when  node 
111  is  labeled  using  the  path  a, 411, 111  it  is  different  from  labeling  111  using  the 
path  a, 154, 111  because  different  continuations  of  these  paths  are  possible.  Thus 
starting  with  a, 411, 111  we  can  continue  to  152  but  not  215  and,  conversely,  start¬ 
ing  with  a, 154, 111  we  can  continue  with  215  but  not  152.  Thus,  Path  1  and  Path 

2  are  not  comparable  in  terms  of  their  lengths  only. 

Such  paths  must  also  be  compared  in  terms  of  their  spans.  We  note  that  if  two 
paths  from  a  to  some  node  i  result  in  sets  having  identical  spans  then  the  same 
continuations  of  both  paths  are  possible.  (This  was  the  case  for  intersections  of 
two  matroids.)  Indeed,  it  is  possible  for  paths  to  have  slightly  different  spans 
and  still  have  the  same  set  of  possible  continuations.  In  particular,  if  the  only 
difference  in  the  intersections  of  the  spans  of  two  paths  are  nodes  outside  the 
intersection  of  the  spans  of  then  the  paths  are  comparable.  We  can  thus 

generalize  the  augmenting  path  procedure  to  consider  all  undominated  a  to  ?.  paths 
where  one  path  dominates  another  only  if  it  has  the  same  continuations  and  a 
larger  length. 

The  notion  of  a  path  itself,  however,  must  be  generalized  as  well.  In  the  case  of 

3  matroids,  not  all  augmentations  correspond  to  paths.  We  see  an  example  of  this 
for  a  TDAP.  The  augmentation  (412,234,341,123)  -  (111,222,333)  does  not  cor¬ 
respond  to  any  path  in  the  conventional  sense.  It  is  possible  however,  to  extend 
the  augmenting  path  procedure  to  include  such  augmentations  by  extending  the 
notion  of  a  path. 

We  define  a  generalized  augmenting  path  with  respect  to  an  intersection  1^^  to  be 

a  sequence  of  nodes  S  =  (si,s2,  ...  s  )  where  s  c  E  -  1^^  for  odd  i  and  s  c 

(K)  ( K  i  m  1  1 

I  for  even  i.  As  before,  I  '  ♦  s.  -  s2  ♦  s3  -  . . .  ♦  s_  is  an  intersection.  Also, 

the  even  Sj  are  deleted  in  order  to  remove  dependencies  created  by  the  inclusion 
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of  the  odd  s..  Now,  however,  the  subsequences  1^^  •  s(  •  Sj  *  '  ”Sj  ^or  ®ven  1 

need  not  correspond  to  intersections. 

One  can  thus  guarantee  an  optimal  intersection  as  in  the  case  of  two  matroids.  The 
number  of  generalized  augmenting  paths  one  may  need  to  consider,  however,  may 
grow  expbnentially  with  K.  In  practice,  however,  the  number  of  such  paths  can 
be  controlled  at  the  expense  of  optimality.  First,  the  length  of  any  path,  ( s , ,  s2, 

....  «j),  can  be  reduced  by  a  penalty  to  account  for  the  nodes  which  still  must  be 

deleted  to  restore  the  intersection.  In  the  case  of  arbitrary  matroids,  this  may  be 
complex  to  compute.  In  the  case  of  the  TDAP,  however,  where  3  partition 

matroids  are  involved,  and  all  cycles  contain  2  elements,  it  is  easily  computed. 

In  some  cases  the  above  may  keep  the  computations  reasonable.  In  others,  it  may 
be  necessary  to  reduce  the  number  of  paths  considered  by  relaxing  the  definition 
of  dominance.  This  will  also  result  in  a  heuristic  rather  than  an  optimal  solution. 
In  the  case  of  the  TDAP,  one  such  relaxation  is  to  ignore  differences  in  the  spans 
outside  the  intersection  of  the  span  of  I^K\  This  is  motivated  by  the  fact  that 
we  consider  deleting  elements  in  1^  in  order  to  include  elements  blocked  by 
them. 

We  can  thus  consider  a  hierarchy  of  generalized  augmenting  path  procedures  with 
increasingly  stringent  dominance  criteria  and  increasing  runtime.  A  tradeoff 
between  optimality  and  runtime  is  then  available.  We  are  currently  investigating 
this  tradeoff  using  the  TDAP  as  an  example. 
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Two  shortest  path  algorithms  are  compared  and  it  is  shown  that,  while  one  outper¬ 
forms  the  other  in  practice,  the  former's  running  time  is  exponential  in  the  worst  case 
while  the  latter's  is  polynomial.  A  procedure  which  constructs  such  worst  case  ex¬ 
amples  is  given 


In  the  excellent  paper  “A  Computational  Analysis  of  Alternate  Algorithms  and 
Labeling  Techniques  for  Finding  Shortest  Path  Trees”  by  Dial.  Glover,  ICarney,  and 
Klingman  |1]  the  statement  is  made  “that  the  label-setting  algorithm  has  a  worst  case 
bound,  that  is  an  order  of  magnitude  better  than  the  label-correcting  algorithm  "  It  is. 
in  fact,  easily  shown  that  this  statement  is  true  for  the  performance  of  Algorithm  Cl 
relative  to  the  performance  of  a  label  setting  method.  In  Algorithm  Cl,  since  the 
>  sequence  list  is  managed  in  a  FIFO  fashion,  paths  are  generated  in  order  of  the  number 

of  arcs  in  them.  Each  node  can  thus  be  scanned  at  most  A'  times  (creating  paths  with 
K  arcs,  K  =  1, .  .  ,  N),  where  N  is  the  number  of  nodes.  Since  nodes  are  scanned  only 
once  in  label  setting  algorithms,  the  result  follows. 

The  situation  is  surprisingly  different  with  respect  to  Algorithm  C2.  I  have  used  this 
algorithm  to  find  routes  in  very  large,  very  sparse  real  networks  (thousands  of  nodes 
and  average  nodal  degree  between  2  and  3)  with  a  variety  of  length  functions  (gen¬ 
erally  distance-related)  and  have  found  it  to  outperform  all  others  This  is  exactly 
as  the  authors  indicated.  The  worst  case  behavior  of  this  algorithm,  however,  is 
exponential! 

Consider,  for  example,  the  network  in  Figure  1  Suppose  that  the  list  of  nodes  adja¬ 
cent  to  node  1  (the  root)  is  in  ascending  order  of  node  number  and  that  the  list  of 
nodes  adjacent  to  all  other  nodes  is  in  descending  otdei  We  find  that  every  time  a 
node  is  scanned,  node  2  is  relabeled  and  scanned.  In  fact,  node  2  takes  all  labels  be¬ 
tween  20  and  5  and  is  scanned  every  time  it  is  relabeled.  Here  V  =  t>  and  node  2  is 
scanned  2A':  time:,  Netwoiks  with  tho  characteristic  and  of  arbitrary  sire  can  be 
formed  as  follows 

Step  I  Start  with  a  network  with  two  nodes.  I  and  2.  and  two  arcs  (1,2)  and 
(2.1)  with  /. (  1 .  21  -  I  and  U2.  1  )  -  I .  where  /, (i.  /)  is  the  length  of  the 
uic  I  min  i  In; 
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Step  2:  For  K  =  3  to  TV  alter  the  network  as  tollows: 

(a)  Add  node  K 

(b)  Add  arcs  (K.  i)  for  i  =  2 . K -  1  with  UK.  i )  =  1(1,  /'). 

(c)  Set L(l. t)  =L(1. t)  +  +  1 

(d)  Add  an  arc  ( 1 .  K )  with  L  ( I  K )  =  1 

During  the  execution  of  algorithm  C2.  the  node  2  takes  all  labels  from  A'-  1  to 
2n'2  +  N -  2  and  is  scanned  2/v’3  times. 

It  should  be  noted  that  this  pathological  situation  is  a  product  not  only  of  the  un¬ 
usual  arc  lengths  but  also  of  the  ordering  of  the  adjacency  lists,  in  particular  that  lists 
of  adjacent  nodes  are  ordered  with  the  nearest  node  first  for  all  nodes  except  the  root 
If  the  adjacency  lists  were  ordered  oppositely,  each  node  would  be  scanned  only  once 
Thus,  we  see  a  strong  dependence  of  the  performance  of  Algorithm  C2  on  a  preorder 
ing  of  the  links.  The  general  idea  would  be  to  order  links  in  such  a  way  that  they 
would  be  likely  to  be  scanned  in  ascending  order  of  then  labels.  Alternatively,  adja 
cency  lists  could  be  ordered  nearest  node  first  and  the  sequence  list  management  could 
be  altered  only  slightly  to  add  the  entire  1  ist  of  relabeled  nodes  to  top  of  the  sequence 
list  in  forward  order  (rather  than  the  reverse  order  implied  in  Algorithm  C 2 >  ot  the 
adjacency  list  of  the  node  being  scanned. 

In  summary,  we  note  that  Algorithm  C 2  has  excellent  behavior  in  practice  and  pre¬ 
processing  is  probably  warranted  only  il  multiple  applications  of  the  algorithm  are 
being  done  (e  g.,  from  multiple  sources) 
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111).  Probabalistic  Analysis  of  Algorithm  Performance 

The  problem  of  finding  an  optimal  set  of  repeater  locations 
covering  a  given  set  of  terminal  sites,  or  set  of  potential  repeater 
locations  and  a  covering  matrix  specifying  which  terminal  sites  can  be 
covered  by  each  repeater  location,  is  an  important  problem  in  the 
design  of  multihop  packet  radio  networks.  The  problem  in  an  instance 
of  the  classic  set  covering  problem  which  has  many  other  applications 
as  well.  The  set  covering  problem  is  known  to  be  NP-completo  and 
as  such  it  is  unlikely  that  an  algorithm  will  be  found  which  can 
guarantee  an  optimal  solution  and  also  guarantee  a  reasonable  running 
time,  that  is  a  runtime  which  grows  polynomially  in  the  number  of 
terminals  and  repeaters. 

The  theory  of  nondeterministic  polynomial  completeness  (NP- 
completeness)  basically  states  that  there  is  a  large  class  of  problems, 
which  includes  almost  all  optimization  problems  of  interest  in  the  area 
ot  network  design,  which  have  the  property  that  a  solution  to  one 
could  be  used  to  obtain  a  solution  to  all  the  others  in  a  reasonable 
amount  of  time.  It  has  been  shown  that  one  problem,  the  Satisfiability 
Problem,  has  the  property  that  all  problems  in  this  class  can  be 
transformed  into  an  instance  of  it.  Thus,  an  algorithm  which  could 
obtain  an  optimal  solution  to  the  Satisfiability  Problem,  could  be  used 
to  solve  all  the  others.  Techniques  have  been  developed  for  proving 
that  many  other  problems  are  NP-complete.  Proofs  have  been  given 
for  the  N P-completeness  of  over  200  other  well-known  problems. 

In  our  previous  work  we  reported  on  results  obtained  using  a 
new  set  covering  algorithm.  The  algorithm  obtained  optimal  solutions 
tor  moderate  sized  problems  with  up  to  .100  node  networks  and  cover- 


age  related  to  distance.  That  is,  if  the  average  matrix  reflected  the 
fact  that  a  repeater  was  far  more  likely  to  cover  a  nearby  terminal 
than  one  farther  away,  then  the  algorithm  worked  very  well,  con¬ 
verging  to  an  optimal  solution  very  guickly. 

If  on  the  other  hand,  the  coverage  matrix  was  random,  the 
algorithm  ran  much  more  slowly  and  only  problems  with  up  to  50 
terminals  could  be  treated  efficiently.  In  examining  why  the 
algorithm  had  difficulty  treating  problems  with  random  data,  we 
observed  that  the  difficulty  stemmed  from  the  large  number  of  optimal 
and  near  optimal  solutions  available.  The  algorithm  had  no  difficulty 
finding  an  optimal  solution  but,  rather,  had  difficulty  in  verifying  the 
solution  was  optimal  in  a  reasonable  amount  of  time  due  to  the  presence 
of  a  large  number  of  alternate  solutions  of  comparable  guality.  This 
led  us  to  conjecture  that  it  might  be  possible  to  develop  algorithms 
which  had  a  reasonable  running  time  and  which  could  find  optimal 
time  or  near-optimal  solutions  with  high  probability.  It  io  known  that 
asymptotically,  as  the  size  of  the  problem  becomes  infinite,  there 
exists  algorithms  which  give  optimal  results  almost  always.  These 
results  are  an  outgrowth  of  the  same  symmetry  which  makes  it  difficult 
to  verify  the  optimality  of  the  solutions  produced  by  our  set  covering 
algorithm. 

These  results  are  encouraging  but  of  no  direct  use  in  solving 
network  design  problems  since  real  problems  are  of  finite  size  and 
nothing  was  said  about  how  fast  the  algorithms  converge  probablis- 
tically  to  an  optimum;  i.e.,  what  the  probability  is  of  their  finding 
the  optimum,  or  a  solution  within  some  bound  of  the  optimum.  We 
thus  sought  to  investigate  what  could  be  said  about  the  probabalistic 


performance  ol  known  heuristics  tor  problems  ot  moderate  si/e.  Wi 
chose  the  set  covering  problem  mentioned  above  as  the  first  problem 
to  investigate.  The  probabalist ic  mode!  ot  this  problem  follows. 


We  are  given  a  set  1 

“Hi,  t2 , 

...  tNs 

of  \  terminals 

,  a  set 

R  =  {r1(  r  L, ,  ...  rM}  of  M 

potential 

repeater 

sites,  and 

an 

\  X  M 

covering  matrix,  C,  where 

(:..  is  1 

i) 

if  t.  can 

i 

be  covered 

by 

r..  We 

I 

seek  a  subset  of  R  containing  as  few  repeaters  us  possible  and  cover¬ 
ing  all  terminals.  The  elements  of  C  are  chosen  randomly  and  inde¬ 
pendently.  Specifically,  we  assume  F5rS c. .=1  }  =  p  and  Rr { c b j=(> f  =  1  -p=q  . 


We  define  R.,  the  set  of  terminals  covered  by  r,  by: 

Kj  =  ^i'cij  =  ^ 

Similarly,  we  define  Tj,  the  set  of  repeaters  covering  t,  by: 

rT’ .  =  { r .  |  c . .  =  1  } 

A  cover  of  the  terminals  is  then  defined  as  a  subset  SCR  satisfying: 

U  R.  =  T 

r .  t:  S  ^ 

J 

A  minimum  cover,  S*,  is  then  a  cover  containing  as  few  elements  as 
possible ,  i .  e . : 

U  R.  =  T 

r .  r.  S  * 

1 

and  |S*|  <  |S|  for  all  S  such  that  U  R.  =  T. 

r.eS  * 

) 

Note  that  the  minimal  cover  is  not  in  general  unique.  Indeed 
our  approach  rests  on  the  existence  of  a  large  number  of  minimal  and 
near  minimal  covers. 

In  order  to  analyze  the  probabulistic  behavior  ot  a  heuristic  we 
must  do  two  things.  F  irst,  for  a  given  set  of  values  ot  N ,  M,  and 
p,  we  must,  find  the  probability  that  |S*|  =  K,  that  is,  the  probability 
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that  there  exists  a  minimal  covering  containing  exactly  K  repeaters 
Let 


P(N,M,K)  =  Pr{|S*|  =  K  S  for  given  N,M,  and  p. 

\ext  we  must  tind  P .  .  .,(  N  ,  M  ,  K  ) ,  defined  as  the  probabilitv  that  the 

A  l  jU 

algorithm  will  find  a  solution  with  exactly  K  repeaters.  A  Injure  <d 
merit  for  evaluating  the  probabilistic  behavior-  ol  the  algorithm  is 
then 


1  A  LG 


M  M 

1  K  P  (  N  ,  M ,  K  )  -  1  KP(N,M,K) 

:=o  ALCj  «=o 


M 

I  KP(N,M,K) 
K=0 


(  1 1  Id .  1  ) 


1  his  is  simply  the  relative  error  made  by  the  algorithm,  i.e., 
the  difference  between  the  avarage  number  of  repeaters  in  a  covering 
found  by  the  algorithm  and  the  minimum  number  required,  normalized 
by  the  minimum  number  required.  We  believe  that  ^m.  -  0  for 

any  M,  and  p,  and  any  reasonable  algorithm  where  a  reasonable 
algorithm  is  defined  as  one  which  stops  when  it  has  a  cover.  In 
particular,  we  believe  f°r  an  al9°r'thm  which  picks  repeaters 

randomly  until  a  covering  is  obtained. 

We  begin  by  evaluating  Q(N,M,K),  the  probability  that  a  ran¬ 
domly  chosen  set  of  K  repeaters  covers  all  N  terminals.  Notice  that 
this  is  not  the  same  as  the  probability  that  an  algorithm  picking 
repeaters  at  random  will  stop  after  K  repeaters  since  in  some  cases 
fewer  than  K  will  also  suffice  and  the  algorithm  will  tind  them.  Also, 
Q(N',M,K>  is  a  cumulative  distribution,  that  is  it  relates  to  the  proba¬ 
bility  th.it  K  or  fewer  repeaters  are  required  for  a  covering  lather 
than  exactly  K.  We  do  have,  however,  that 
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g(\,M,K  )  V  k  ft  N  ,M,K  )  (  III).  J  ) 

l.=U 

th.it  is,  the  probability  ot  a  randomly  chosen  set  of  k  repealers 
coveting  all  \  terminals  is  less  than  or  equal  to  the  probability  ol  the 
existence  of  a  covering  containincj  K  or  fewer  terminals.  Thus  we 
have , 

F(N,M,K)  >  g(N,M,k)  -  Q(N,M,K-1),  K'l, 
that  is,  the  difference  between  two  successive  g's  is  a  lower  bound 
on  the  corresponding  IT 

g(N,M,K)  is  easy  to  evaluate  because  repeaters  picked  randomly 

are  picked  independently  of  one  another  and  our  random  model 

assumed  their  original  characteristics  are  independent.  The 

probability  of  a  single  terminal  not  being  covered  by  a  single  repeater 

is  q(  =  l-p).  The  probability  ol  a  single  terminal  not  being  covered  by 

any  of  the  K  repeaters  is  q  .  The  probability  of  at  least,  one  repeater 

of  K  covering  a  given  terminal  is  then  1-q  .  finally,  the  probability 

K  N 

that  all  N  terminals  are  covered  by  the  K  repeaters  is  (1-q  )  .  We 

thus  have  that 

Q(N,M,K)  =  (1  -  qK)N  (I  ID.  3) 

Note  that  this  is  not  a  function  of  M  except  in  the  trivial  sense 
that  M  must  be  no  smaller  than  K.  figure  110. 1  shows  values  of 
g(N,M,K)  for  p=.5.  Figure  IID.2  shows  values  of  g(N,M,K)  - 
g<N,M,K-l)  for  the  same  examples.  Both  show  a  tendency  tor-  solu¬ 
tions  to  cluster  about  a  narrow  range  of  K.  This  is  encouraging  in 
that  it  implies  that  probabalist ically ,  simple  algorithms  should  do  well. 


:)cr  of  Repeaters,  K 


ID.  1  CUMULATIVE  PROBABILITY  OF  A  COVER  i.\( 
RANDOM  ALGORITHM  (p=0.5). 
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As  yet  ,  we  do  not  have  an  exact  analytic  expr  ession  ha 
l'i\,M,K).  Ur  can  yet  a  tighter  lower  bound  on  d  by  evaluating  tin 
distribution  of  solutions  Irom  increasingly  sophist  icated  algorithm:  .  A 
simple  minded  algorithm  would  be  to  pick  repeaters  randomly  until  one 
obtained  a  coveriny.  I  he  probability  ot  obtaining  a  solution  with  k 
or  h'wer  repeaters  usiny  this  alyorithm  is  P,i\,M,k)  yiven  by: 


P ,  (  \  ,  M  ,  K  ) 


N 

z 

i=() 


M 

i 


i  N- 
P  q 


P ,  (  N-i ,  M-l  , k-  I  ) 


(  III).  1  i 


since  the  k1^1  repeater  will  cover  i  terminals  with  probability  (^)p!y  '  1 
and  the  remaininy  k-l  of  M-l  repeater  must  cover  the  remnininy  N-i 
terminals.  Note  that  this  is,  ayain,  not  <i  (unction  ot  M  except  that 
M  must  be  no  smaller  than  k.  The  probability  ol  obt.iininy  a  coveriny 
wit  h  exactly  k  r  epeaters  is  yiven  by  P ,  (  N  ,M ,  k  )-P |  (  N  , M ,  k- 1  ) . 

A  somewhat  more  sophisticated  algorithm  is  to  consider  repeaters 
in  <i  random  order  and  to  select  a  repeater  only  it  it.  improves  the 
chances  ot  obtaining  a  covering  more  by  (ricking  it  than  by  not 
(ricking  it.  betting  P2(N,M,K)  be  the  probability  erf  obtaining  a 
covering  containing  k  cr  fewer  repeaters  using  this  algorithm,  we 
have 


p.,(,\,M,k)  -  2  N  p‘(]N"l!max  <!>.,(  N-i,  M-l, k-l),  p..<  N  ,  M- I  ,  k  )  )  | 
i -0  1 

(III),.) 

l’L.(\,M,k)  is  plotted  for  N=.rr  and  N-IU  in  Pigure  IID.d.  Along  wit h 
it,  the  probability  ol  obtaining  a  cover- any  using  k  randomly  chosen 
repeaters  is  plotted.  Notice  that  this  algorithm's  performance  is  a 
substantial  improvement  over  randomly  [licking  repealers.  Notnc 
also,  that  p.,(\,M,k>  is  a  lunclion  of  M.  This  algorithm  is  vei  v 
simple  and  has  a  r  easonable  running  lime  even  lor  very  laige  \  md 
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M.  Our  next  objective  is  to  compare  il  with  an  upper  bound  mi 
i'(\,M,K)  to  see  if  it  already  has  converged  to  near  optimal 
pertormance  probabalistically . 


CL _ PROBABILISTIC  ANALYSIS  OF  ALGORITHMS 

We  have  obtained  results  in  this  area  indicating  that  even  rela¬ 
tively  simple  algorithms  will,  with  high  probability,  obtain  near  optimal 
solutions  to  certain  difficult  optimization  problems.  This  is  encouraging 
in  that  it  implies  that  problems,  previously  considered  intractable,  may 
in  fact  be  solvable,  at  least  with  hiav  expectation  of  success.  The 
results  are  also  useful  in  guiding  heuristics  towards  potentially  fruitful 
areas  within  the  solution  space  and  in  avoiding  unnecessary  et’ort  m 
areas  where  little  further  progress  can  be  expected. 
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Specifically ,  we  have  analyzed  the  intrinsic  probabilistic  behavior 
of  the  3-Satisfiability  Problem  and  found  that,  independent  ol  the  size 
of  the  problem,  there  is  only  a  limited  region  of  uncertainty,  i.e.,  only 
a  finite  set  of  values  where  even  the  simplest  sensible  algorithm  can 
make  an  error.  This  problem,  which  is  described  in  detail  below,  is 
representative  of  a  very  large  class  of  difficult  combinatorial  problems 
and  can,  in  fact,  be  used  as  a  vehicle  for  solving  many  other  problems. 
Thus,  we  believe  these  results  can  be  extended  to  other  problems  as 
well,  most  notably,  to  problems  in  network  design  such  as  facility 
location  and  link  topology  optimization.  We  are  currently  in  the  process 
of  investigating  such  extensions  and  of  examining  the  probabilistic 
performance  of  several  specific  heuristic  algorithms. 

C- 1 .  Probabilistic  Behavior  of  the  3-Satisfiability  Problem 

The  3-Satisi. ability  Problem  has  a  simple  and  homogeneous  struc¬ 
ture  which  makes  it  easy  to  develop  and  study  a  probabilistic  model.  It 
also  does  not  involve  any  numerical  data  which  would  otherwise  obscure 
the  nature  and  generality  of  the  results  which  we  obtain.  These  re¬ 
sults  can,  however,  be  directly  extended  to  optimization  problems  which 
do  involve  numerical  data,  as  we  will  see  in  the  following  discussion. 

In  the  ordinary  satisfiability  problem,  we  are  given  a  Boolean 
expression,  F, ,  over  m  Boolean  variables  v1(v2,...,v  and  we  ask  if 
there  exists  a  set  of  truth  values  for  the  variables  which  will  result  in 
E  taking  the  value  True.  Thus,  each  of  the  v^  can  take  either  the 
value  True  or  False  (alternatively  denoted  by  1  or  0)  and  F  will  then 
take  either  the  value  True  or  False  based  on  the  values  of  the  v..  F  is 

i 

usually  given  in  disjunctive  normal  form,  i.e.,  as  a  set  of  clauses  all  of 
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which  must  be  True  in  order  for  E  to  be  true.  Each  clause  contains 
one  or  more  variables  and  is  said  to  be  true  if  at  least  one  of  the 
variables  in  the  clause  is  assigned  the  value  it  takes  in  the  clause. 
Thus  for  example,  E  =  (vt  +  v2 )( v !  +  v3)  has  2  clauses.  The  first, 
(Vj  +  v2)  is  true  if  either  V;  is  True  or  v2  is  True.  The  second  is 
[rue  if  either  Vj  is  False  or  v3  is  True.  Thus  E  is  satisfiable  as  the 
values  vt  =  True,  v2  =  True,  and  v3  =  True  satisfy  both  clauses  and 
hence  E.  There  are  several  other  assignments  of  truth  values  v1(  v2 , 
and  v3  which  will  satisfy  this  E.  The  expression  (VjXvj  +  v2)(v2)  on 
the  other  hand  is  not  satisfiable. 

The  3-Satisfiability  Problem  is  a  version  of  the  ordinary  Satisfia¬ 
bility  Problem  where  all  clauses  contain  exactly  3  variables.  Garey  and 
Johnson,  in  their  book  Computers  and  Intractability,  show  this  problem 
is  NP-complete  by  showing  that  any  ordinary  Satisfiability  Problem  can 
be  transformed  into  a  corresponding  3-Satisfiability  Problem  of  roughly 
the  same  size.  Thus  the  two  problems  are  equivalent. 

Many  other  problems  can  also  be  transformed  into  corresponding 
3-Satisfiability  Problems.  In  particular,  combinatorial  optimization 
problems  such  as  the  Traveling  Salesman  Problem  or  the  problem  of 
locating  earth  stations  in  a  satellite  network  can  be  so  transformed. 
Thus,  we  can  produce  a  Boolean  expression  which  corresponds  to  a 
particular  optimization  problem  in  the  sense  that  if  the  expression  is 
satisfiable  then  the  optimization  problem  has  a  solution  with  a  value  less 
than  or  equal  to  a  given  constant  (for  minimization  problems)  or  greater 
than  or  equal  to  a  given  value  (for  maximization  problems).  Further¬ 
more,  the  satisfying  truth  assignment  can  be  used  to  obtain  the  solution 
to  the  corresponding  optimization  problem  directly.  By  altering  the 
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value  of  the  constant  in  the  above  transformation  the  optimal  solution  to 
the  corresponding  optimization  problem  can  be  found  via  binary  search; 
i.e.,  if  we  know  the  optimum  lies  between  values  Cj  and  c2,  we  try  the 
value  Cj  +  c2/2.  This  technique  is  known  as  thresholding. 

One  can  determine  if  an  expression  is  satisfiable  by  assigning  all 
2m  possible  truth  values  to  its  variables.  This  approach  is,  of  course, 
not  practical  for  large  values  of  m.  There  are  many  sophisticated  tech¬ 
niques  for  answering  the  question  whether  or  not  a  given  expression  is 
satisfiable,  but  all  have  running  times  which  ultimately  grow  exponen¬ 
tially  with  the  number  of  variables  and  thus  are  limited  to  problems  of 
modest  size. 

We  obtain  here  a  technique  which  yields  the  a  priori  probability  of 
an  expression  with  a  given  structure  being  satisfiable  given  a  probabil¬ 
istic  model  of  the  space  from  which  the  problem  is  drawn.  For  prob¬ 
lems  where  this  probability,  Pg,  is  very  close  to  1  or  very  close  to 

zero,  we  need  seek  no  further.  Only  for  problems  where  P  is  signifi- 

s 

cantly  different  from  0  or  1  need  the  question  be  investigated  any 
further.  In  some  cases,  the  Pg  itself  is  all  that  we  need.  For  exam¬ 
ple,  if  we  have  found,  using  a  heuristic  algorithm,  a  solution  of  value 
c,  to  a  given  maximization  problem,  and  have  determined  that  P  is  less 
than  .01  for  an  expression  corresponding  to  the  existence  of  a  solution 
of  value  greater  than  or  equal  to  Cj  +  a  (for  a  much  smaller  than  c,), 
then  we  are  reasonably  certain  that  there  is  not  much  to  be  gained  from 
an  attempt  at  further  optimization.  This  is  very  important  because  it 
has  often  turned  out  that  it  is  much  harder  to  verify  the  optimality  (or 
near-optimality )  of  a  solution  than  to  find  an  optimal  solution  especially 
when  the  solution  space  is  rich  and  there  exist  many  alternate  near- 
optimal  solutions. 
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We  consider  a  uniform  probabilistic  model  for  the  3-Satisfiability 
Problem  where  all  clauses  are  equally  likely.  Thus  a  problem  is  totally 
characterized  by  v  and  b,  the  number  of  variables  and  clauses,  respec¬ 
tively.  We  will  consider  two  models  which  are  nearly  equivalent.  In 
one  case,  clauses  are  picked  without  replacement,  i.e.,  a  given  clause 
can  appear  at  most  once  in  an  expression.  In  the  other  case,  clauses 
are  picked  with  replacement. 

Duplicate  clauses  do  not  affect  the  satisfiability  of  an  expression. 
We  can  thus  relate  problems  w’here  the  clauses  are  chosen  with  replace¬ 
ment  to  problems  where  the  clauses  are  chosen  without  replacement  by 
saying  that  a  problem  with  v  variables  and  b  clauses  chosen  with  re¬ 
placement  is  the  same  as  a  problem  with  v  variables  and  b '  clauses 
chosen  without  replacement  if  the  number  of  distinct  clauses  in  the  first 
problem  equals  b'.  The  relationship  between  the  number  of  clauses 
chosen  with  replacement,  b,  and  the  average  number  of  distinct  clauses 
is  given  by  the  following  argument. 

Let  X^  be  the  number  of  distinct  clauses  when  b  clauses  are 
selected  with  replacement.  Then  <  b  and  X^  =  X^j  +  a,  where 

a  -  1  with  probability  P  and  a  =  0  with  probability  1  -  P  P 

2  new  v  2  new  new 

is  the  probability  that  the  b1*"1  clause  is  different  from  all  the  first 
b  -  1 .  P  is  simply  the  ratio  of  the  number  of  unchosen  clauses  to 
the  total  number  of  possible  clauses.  The  number  of  possible  clauses  is 

B  =  8(3) 

since  each  clause  contains  3  variables  each  of  which  can  take  either  of 

two  values.  Thus  P  is  given  by: 

new  y  2 


P 

new 


B 


kb-l 


B 


(C-1  ) 
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Let  f(b)  be  the  average  number  of  distinct  clauses.  Then 

E(XJX.  i)  =  xk  i  +  ,  (G-2) 

b  b-1  b-1  new 

and 

f(b)  =  E(Xb)  =  f(b  -  1)  +  B—  B(b"~  •  ;t-3) 

Thus 

f(b)  =  f(b-l)  x  Brl+  1  .  (04) 

As  an  example,  if  v  =  10,  then  B  =  960  and  f ( 50 )  =  48.7.  As  we  will 
see,  we  are  most  interested  in  values  of  b  near  5v.  For  v  =  20  and 
b  =  100,  f  =  99.5.  As  v  increases,  f/b  approaches  1  very  quickly. 
Thus,  there  is  little  difference  between  choosing  clauses  with  or  without 
replacement. 

C.2.  Analytical  Method  1 

We  now  turn  to  the  problem  of  estimating  P  (v,  b),  the  probability 
of  expression  E,  with  v  variables  and  b  clauses,  being  satisfiable. 
Given  a  problem  with  v  variables  and  b  clauses,  we  define  the  truth 
assignment  graph,  G,  associated  with  that  problem  to  be  a  graph  with 
2V  +  1  nodes.  Nodes  1  through  2V  correspond  to  the  possible  truth 
values  for  the  v  variables  and  node  0  is  a  distinguished  pseudonode. 
Each  clause  in  E  is  inconsistent  with  one  eighth  of  the  truth  assign¬ 
ments,  specifically  those  in  which  all  three  variables  in  the  clause  have 
values  opposite  to  their  values  in  the  clause.  G  contains  an  arc  be¬ 
tween  node  0  and  any  node  corresponding  to  a  truth  assignment  which 
does  not  satisfy  E.  We  thus  proceed  through  E  clause  by  clause  and 
add  arcs  between  node  0  and  nodes  which  do  not  satisfy  the  clause. 
There  are  2V  ^  arcs  corresponding  to  each  clause.  In  general,  arcs 


"belonging"  to  different  clauses  overlap,  i.e.,  there  will  be  truth 
values  inconsistent  with  more  than  one  clause.  We  do  not  show  the 
multiplicity  of  these  arcs.  Thus,  G  will  in  general  contain  fewer  than 
b  *  2V  arcs.  We  are  interested  in  G  because  P  the  probability  E  is 
satisfiable,  is  precisely  equal  to  the  probability  that  G  is  not  con¬ 
nected.  Figure  C.l  shows  a  truth  assignment  graph. 

The  problem  of  whether  a  random  graph  is  connected  or  not  has 
been  studied  extensively  by  Erdos  and  Renii.  We  will  proceed  along 
similar  lines. 


E  =  (v,  +  v2  +  v3)  (v,  +  v2  +  v4  )  (v2  +  v3  +  v4 ) 
FIGURE  C.l.  A  TRUTH  ASSIGNMENT  GRAPH 


We  define  A(v,  k,  b)  to  be  the  number  of  Boolean  expressions 
whose  corresponding  truth  assignment  graphs  contain  k  or  more  isolated 
nodes  (i.e.,  nodes  with  no  incident  links).  P  (v,  b)  is  then  given  by 


Ps(v,  b) 


1  (-l)k 

k=0 


A  ( v , 
( 


b) 


t  ( ) 
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where  B  =  8(3). 

This  relation  follows  from  the  fact  that  if  an  expression  is  satis- 
fiable  then  its  corresponding  truth  assignment  graph  must  contain  at 
least  one  isolated  node.  Note  that  the  alternating  sum  is  required  in 
order  to  account  for  the  situations  where  graphs  with  more  than  k 
isolated  points  are  included  in  terms  with  k  or  more  isolated  points.  We 
seek  the  number  of  expressions  with  graphs  with  one  or  more  isolated 
points.  The  alternating  sum  gives  us  the  number  with  exactly  zero 
isolated  points. 

We  can  write  A(v,  k,  b)  as 

v 

A(v,  k,  b)  =  I  (Y)  21  NM(v,  i,  k)  B(v,  i,  k,  b)  (06) 

i=0  1 

where  NM(v,  i,  k)  is  the  number  of  ways  of  selecting  k  clauses  which 
match  in  i  specific  variables  (i .  e . ,  k  clauses  in  which  i  specific  vari¬ 
ables  take  the  same  truth  value  in  all  k  clauses)  and  B(v,  i,  k,  b)  is 
the  number  of  Boolean  expressions  whose  corresponding  truth  assign¬ 
ment  graphs  have  k  or  more  isolated  points  and  the  isolated  points 
match  in  i  variables. 


NM(v,  i,  k)  = 


v-i  m  9v-i-m 
I  (-l)m  (2 
m=0  k 


)  (v_1)  2m 
m 


—  (2^ 
k!  u 


-  2) 


V-) 


(07) 


where  the  approximation  is  made  by  replacing 
9v-i-m  0v-i-rrKk 

<  k  >bviV-J- 

Now,  B(v,  i,  k,  b)  =  (c^vOk)  )  where  c(v,  i,  k)  is  the  total 
number  of  clauses  which  are  permissible  in  the  sense  that  they  give 


rise  to  graphs  that  have  the  requisite  number  of  isolated  points  and 
matched  variables.  c(v,  i,  k)  can  be  estimated  by 


c( v ,  i,  k)  s  8(3)  exp[- 


2k  k(v-i)  } 
(2k-2)8v 


(C-8) 


The  details  of  this  approximation,  which  are  somewhat  lengthy,  are 
given  in  W.  Chuang's  thesis. 

Using  the  above  approximations  and  Sterling's  approximation 
n!  s  V2n  NN+1*  e‘N 

we  find  by  algebraic  manipulation  that 

Ps(v,  b)  s  1  -  exp  [-exp  (-c)]  (C-9) 

where  b  =  8(£n  2V  +  c) 
and  c  is  a  constant. 


Thus,  only  for  b  2  5.5  v  does  P  (v,  b)  take  values  significantly 

b 

different  from  0  or  1  since  for  any  value  of  c  less  than  -2,  Pg  is  nearly 
1  and  for  any  value  of  c  greater  than  +4,  Pg  is  nearly  0.  Figure  C.2 
summarizes  the  relationship  between  Pg  and  c.  Note  that  this  is  not  a 
function  of  v  or  b  directly.  The  approximations  used  in  obtaining  this 
analytic  form  are,  however,  functions  of  v  as  we  will  see.  The  approx¬ 
imation  turns  out  to  be  reasonable  for  modest  values  of  v  (e.g.,  v  in 
the  range  10  to  30)  which  we  are  interested  in.  Note  especially  the 
sharp  drop  from  1  to  0  over  the  interval  c  =  -2  to  c  =  +4. 


C.3.  Analytic  Method  2 

Because  of  the  approximations  made  in  the  previous  analysis,  we 
sought  a  second  method  to  independently  verify  the  approach.  We 
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consider  the  case  now  wnere  clauses  are  selected  with  replacement.  We 
note  that  all  clauses  are  equally  likely  and  that  a  given  clause  elimi¬ 
nates  1/8  of  the  possible  truth  assignments.  Indeed,  because  of  this 
symmetry,  on  the  average  the  btb  clause  in  an  expression  eliminates  1/8 
of  the  remaining  truth  assignments  which  satisfy  the  first  b-1  clauses. 
We  say  a  truth  assignment  is  eliminated  if  it  does  not  satisfy  an  expres¬ 
sion.  Thus,  if  we  define  N  (v,  b)  as  the  number  of  truth  assignments 
satisfying  an  expression  with  v  variables  and  b  clauses,  we  have  the 
recurrence  relation 


Ng(v,  b)  =  Ng(v,  b-1)  -  2 


v-3  .  2  -  'Ns<v'  b-1}  „v-3 


=  l  Ns(v,  b-1 )  =  (g  )b  Ns(v,  0)  (C-10) 

where  Ng  is  the  expected  value  of  Ng. 

Since  N  (v,  0)  =  2V,  i.e.,  all  truth  assignments  satisfy  an  expres- 

b 

sion  with  no  clauses,  we  have 


Ns(v,  b)  =  (?)b  2V 


(C-ll) 


P  (v,  b)  is  by  definition  the  probability  that  Ng(v,  b)  is  greater 
than  zero.  Unfortunately,  we  only  have  an  expression  for  N  the 
average  value  of  Ng  and  not  for  the  value  of  Ng  itself. 

If  Ng(v,  b)  >>  1,  then  we  would  expect  Pg(v,  b)  s  1.  If  Ng(v,  b)  <  1, 
then  we  may  assume  that  N  (v,  b)  is  either  0  or  I .  Then 

P  (v,  b)  =  P(Ng(v,  b)  =  1)  =  ^s(v.  b).  Thus  we  use  the  approximation 

1  ,  Nfv,  b)  >  1 

P  (v,  b)  =  b  < ( : - 1 2 > 

Ng(v,  b),  N  (v,  b)  <  1  . 
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This  approximation  is  reasonably  good.  We  first  find  b  such  that 


Ng(v,  b)  =  1: 


(§)b  2V 


1 


b  log28  -  log27  ~  52  v 
We  then  have 


Ps(v,  b)  = 


1 


,7.b-5.2v 

V 


b  <  5 . 2v 
b  >  5.2v 


(C-13) 


(C-14 ) 


This  function  is  very  similar  to  the  one  obtained  by  the  first 
analytic  method  and  again  the  region  over  which  Pg(v,  b)  drops  effec¬ 
tively  from  1  to  0  is  limited  again  to  a  small  range  and  is  independent 
of  v  and  b.  Figure  C.3  illustrates  the  relationship  between  Pg  and  c 
(where  c  =  -  ~q5'2-  )• 

The  dotted  line  of  Fig.  C.3  shows  P  as  found  by  Method  1  for 

b 

the  same  values  of  c.  As  can  be  seen,  for  c  >  1  the  curves  are  vir¬ 
tually  identical.  There  is,  however,  a  5%  difference  Ff'fween  the  con¬ 
stants  relating  v  and  b  in  the  two  approximations.  Nevertheless,  the 
two  analyses  corroborate  eath  other  and  give  rise  to  the  same  analytical 
behavior  of  P  ;  i.e.,  v  and  b  are  linearly  related  at  the  point  where  P 
drops  and  Pg  decays  exponentially. 


C.4.  Simulation 

Finally,  in  order  to  compare  the  accuracy  of  the  two  analytic 
methods,  we  performed  a  simulation  to  measure  P  (v,  b)  directly.  In 
the  simulation,  clauses  were  generated  at  random  with  replacement,  all 
clauses  being  equally  likely.  Each  time  a  clause  was  generated,  the 
truth  values  it  eliminated  were  eliminated.  This  procedure  was  con- 
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tinued  until  no  truth  values  remained.  For  each  value  of  v,  the  value 
of  b  at  which  the  last  remaining  truth  value  was  eliminated  was  re¬ 
corded  and  became  a  sample,  b..  An  estimate  of  P  (v,  b)  is  then 

l  I) 

formed  by 


(#  of  b.  >  b) 

ps<v-  b)  '  (total  »  of  b,)  (t>15) 

Figure  C.4  gives  the  results  of  this  simulation  for  v  =  10,  20  and  30. 
As  can  be  seen,  the  true  value  of  P  (v,  b)  lies  between  the  N  values 
predicted  by  the  two  analytic  methods  and  is  somewhat  closer  to  that 
predicted  by  the  second  method.  The  important  property  of  rapid 
decay  from  1  to  0  is  borne  out. 
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Second-Order  Greedy  Algorithms  lor  Centralized 
Teleprocessing  Network  Design 

A  KERSilbNHAUM.  Ml  \:in  |<  ||  |  1  |;  H(  X  ■  U  S  I  > 

MI-MBIK,  llll.  SMI  l<  .ll'l’l  Mil  |S| 

Abstract — Wt  consider  the  problem  of  designing  4  centr.ili/ed 
telecommuniciidon  network  comprised  of  multipoint  lines  given  4  set 
of  terminal  locations,  traffic  requirements,  and  a  common  ventral 
lite.  The  optimal  solution  to  this  problem  is  a  capacitated  minimal 
spanning  tree.  Vtr  develop  a  class  of  heuristic  algorithms  for  the 
solution  of  this  problem  hy  imbedding  existing  heuristics,  relerred  to 
as  first-order  greedy  algorithms,  inside  a  loop  where  small,  carefully 
chosen  sets  of  arcs  are  alternately  forced  in  and  out  of  the  solution 
The  resultant  procedure  is  shown  to  be  superior  to  existing  tech¬ 
niques,  producing  solutions  typically  2  percent  belter,  while  requiring 
only  a  modest  amount  of  additional  computer  lime. 

INTRODUCTION 

The  problem  considered  is  that  of  finding  an  optimal 
(minimum  cost)  design  for  a  centralized  telecommunication 
network  given  a  set  of  terminal  locations,  traffic  magnitudes 
between  these  locations,  and  single  common  source  or  destina¬ 
tion  (central  site).  In  order  to  retain  simplicity  and  low  cost  in 
the  terminal  hardware,  such  networks  are  configured  as  trees 
comprised  of  communication  facilities  of  a  single  capacity. 
Thus,  the  optimal  solution  to  this  problem  is  a  capacitated 
minimal  spanning  tree  (CMST),  i.e.,  a  tree  of  minimum  total 
length  satisfying  a  constraint  or  set  of  constraints  that  limit 
the  total  traffic  and/or  number  of  nodes  in  any  subtree  rooted 
at  the  central  site.  In  centralized  networks,  such  subtrees  are 
called  multipoint  lines 

More  formally,  we  are  given  a  set  of  N  locations  (nodes),  in 
addition  to  the  central  site ,  a  symmetric  distance  measure  D  = 
{d,j  1 1,  /'  =  0,  I,  A'},  giving  the  cost  between  any  pair  of 
locations,  and  a  constraint  Af  on  the  total  number  of  nodes  or 
traffic  ir  a  multipoint  line.  We  seek  a  spanning  tree  T  rooted  at 
the  center,  satisfying  the  constraint,  and  of  minimum  total 
length 

This  formulation  is  quite  general.  All  we  require  of  the  cost 
function  is  that  the  cost  of  a  link  d,t  not  depend  upon  what 
other  links  arc  present  in  the  solution.  The  constraint  is,  like¬ 
wise,  quite  general.  One  can,  in  fact,  have  more  than  one  con¬ 
straint,  and  the  constraints  may  be  on  total  traffic,  number  of 
nodes,  nodal  degree,  number  of  links  in  cascade,  or  any  other 
quantity  associated  with  the  design.  We  require  only  that  if 
some  subtree  S  does  not  satisfy  the  constraints,  then  no  other 
subtree  S  ,  containing  S,  satisfies  the  constraints.  We  also  as¬ 
sume  that  a  star  solution,  i.e.,  all  nodes  connected  directly  to 
the  center,  is  feasible.  This  can  always  be  satisfied  by  split¬ 
ting  a  location  into  two  or  more  nodes 

Several  heuristics  and  optimal  techniques  have  been  devel- 
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oped  lor  the  solution  of  this  problem  I  he  optimal  le. 

111-131  are  bran  li  and  bound  prove  ones  vsuich  n  , 
have  running  tones  winch  ate  i  .xpon'  1  nal  in  ihc  ,r.:  ■  ,-t 
nodes  While  they  are  thus,  not  pr.iclic..!  as  design  pi 
results  obtained  using  then:  I  in  par  I  n  ul  o  a  1  c,  cut  1 .  : .  ■ 
procedure  [  7  |  )  have  led  to  some  insight  into  the  rctuu 
heuristic  procedures  The  procedure  described  below  0  .' 

growth  ot  this  work. 

Currently,  the  most  widely  used  procedures  for  the  solution 
of  the  CMST  problem  are  heuristics  ( 3  j .  [4 ) ,  [  8  | ,  [  1 0 1  w  hie n 
produce  solutions  within  50  percent  ot  the  optimum  (it 
where  the  optimum  is  known)  and  winch  have  running  'w  .  ■ 
which  are  a  low-order  polynomial,  generally  between  qua.:: 
and  cubic,  in  the  number  ot  nodes.  Karnaugh  i b  1  ha .'!■>  . 
a  family  of  second-order  greedy  algorithms  iSOfi.Vsi  a';..: 
iterate  the  above  heuristics  (which  he  refers  to  as  I-  (::r  (i 
OGA’s),  and  have  longer  running  tames,  but  produce  re  .nits 
generally  2-3  percent  better  than  the  above  heurisUvs  l  he 
procedures  described  below  are  variants  ol  trie  general  v  ><  ,  \ 
procedures  described  by  Karnaugh,  produce  results  ot  .  m 
parable  quality,  and  are  considerably  ,'astet  than  11  is  . 

in  many  cases  their  running  times  are  competitive  wp.,  the 
simpler  procedures  (  FOGA’s)  previously  used 

PROCEDURAL  DESCRIPTION 

The  most  often  used  heuristic  solutions  to  the  CMST  prob¬ 
lem  share  the  following  properties:  1)  their  runnuig  time  :s  a 
polynomial  in  the  number  of  nodes,  2)  in  the  absence  ol  con¬ 
straints.  they  will  yield  a  minimum  spanning  tree  (MST).  and 
3)  the  quality  of  the  solution  (i.e.,  the  amount  by  which  it  .fit 
fers  from  the  optimum)  is  not  controllable  and,  excel  :  -  cr\ 
loosely  ,  is  not  known. 

The  basic  heuristics  used  to  solve  CMST  problems  cat:  re¬ 
divided  into  two  categories  -  primal  procedures  winch  seer,  to 
improve  a  feasible  starting  solution  or  partial  solution,  ar.u 
dual  procedures  which  seek  to  make  a  low  cost  (infeasible; 
starting  solution  feasible.  We  concentrate  on  primal  proce¬ 
dures,  having  found  them  to  be  genctaily  more  flexible  it 
has  been  shown  (81  that  most  such  procedures  fall  within  the 
framework  of  the  following  scheme. 

1)  Start  with  each  node  on  a  separate  multipoint  line 
directly  connected  to  the  center  Associate  J  weight  w,  with  cuvh 
node  1  by  applying  a  given  rule  fw-rule).  For  each  potential 
Lr.k  interconnecting  a  pair  of  nodes  <  and  /,  define  a  trade- 
olf  function  t„  as  —  », 

2)  Consider  the  inot  previously  considered)  /.,,  for  winch 

is  minimum.  If  nod.-?  1  and  /  are  in  separate  multipoint  lines 

and  the  subtree  formed  by  merging  these  lines  does  not  violate 
,tny  vonstrauit,  then  add  /.,;  to  the  network,  replacing  or 
wluchever  is  more  costly.  II  not,  reject  /. 

3)  Update  the  >•-,  and  and  return  to  Step  I  until  no 
turthei  gain  can  be  obtained.  Usually  ,  w,  u,  for  nodes 
1  and  in  the  same  multipoint  line. 

1  he  implementation  o1'  such  a  procedure  has  been  considered. 
:n  detail  (9)  a  careful  implementation  is  shown  to  be  o. 
ordet  A  *  log2  A  where  A  is  tl  number  ot  locations  and  it  one 
wishes  to  consider  all  branches  It  is  ot  order  A  A  log,  n  one 
only  examines  the  A  nearest  neighbors  of  eaJi  node  an, I  :l  the 
w  rule  itself  is  not  too  computationally  complex  In  practice 
the  most  widely  used  »  rule  is  to  set  the  weight  ot  each  node 
in  a  subtree  to  be  the  minimum  distance  between  any  node  in 
the  subtree  and  the  center  1  Ins  is  known  as  the  1  sau -Williams 
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algonthm  Karnaugh  (6]  described  an  alternate  implementa¬ 
tion  of  the  F.sau-Wiliianis  procedure  which  has  a  different 
computational  complexity,  generally  between  quadratic  and 
cubic,  but  which  produces  identical  results  While  the 
SOClA’s  described  below  can  work  with  any  imbedded  FOGA, 
for  the  sake  of  comparison  with  previous  work,  we  performed 
experiments  with  an  imbedded  Esau-Williams  algorithm. 

In  order  to  implement  a  SOGA,  one  must  decide  which  arcs 
one  will  attempt  to  force  in  or  out  of  the  solution.  Karnaugh 
suggested  two  possibilities  which  are  briefly  described  below. 
The  reader  is  referred  to  (6}  for  a  complete  description. 

1)  Inhibit- At  each  stage  in  its  execution,  the  FOGA  brings 
in  one  arc  connecting  two  previously  unconnected  sets  of 
nodes.  The  Inhibit  loop  successively  prevents  each  of  the 
mergers  which  took  place  in  the  previous  iteration  from  taking 
place.  Thus,  one  iteration  of  the  Inhibit  loop  involves  up  to  A' 
(where  A  is  the  number  of  terminals)  iterations  of  the  FOGA, 
each  of  which  prevents  a  single  pair  of  node  clusters  from 
merging.  At  the  end  of  each  iteration  of  the  Inhibit  loop,  the 
best  of  the  generated  solutions  (and  its  associated  cluster 
inhibitions)  is  kept  and  used  in  place  of  the  original  (unhibited) 
FOGA  solution,  and  the  Inhibit  ioop  is  repeated  until  no 
further  progress  is  made. 

2)  Join-For  each  node  a,,  find  its  nearest  neighbor  b,  and 
nearest  neighbor  among  those  nodes  which  are  closer  to  the 
center  than  at,  c,\  Successively,  one  arc  at  a  time,  force  arcs  (a,, 
b,)  and  (a,,  c,)  into  the  solution  if  they  are  not  already  present. 
One  Join  loop  consists  of  a  sequence  of  executions  of  the 
FOGA,  each  with  a  single  forced  arc.  The  Join  loop  is  iterated, 
starting  from  the  best  solution  obtained  during  the  previous 
loop,  until  no  further  progress  is  made.  The  number  of  FOGA 
iterations  in  a  Join  loop  is  bounded  from  above  by  2N  —  1 
but  is,  on  the  average,  somewhat  leas  than  A,  the  exact  number 
being  a  function  of  the  particular  problem. 

Thus,  both  the  Inhibit  and  Join  procedures  have  running 
times  of  order  CA  times  the  running  time  of  the  embedded 
FOGA,  the  Join  procedure  being  somewhat  faster.  The  factor 
C  is  the  number  of  iterations  required  for  convergence,  i.e., 
to  reach  the  point  of  no  further  progress.  It  would  be  possible 
to  greatly  improve  the  running  time  of  the  SOGA  if  one  could 
restrict  one’s  attention  to  a  small  (i.e.,  <N)  subset  of  forced 
uiclusions  or  exclusions.  Also,  Karnaugh  mentioned  that  the 
Join  procedure,  wlule  somewhat  faster  than  Inhibit,  is  weaker 
because  it  restricts  itself  to  local  transformations  involving 
nearest  neighbors. 

The  procedure  described  in  [7|  is  a  branch-and-bound 
algonthm  for  obtaining  optimal  solutions  to  CMST  problems. 
While  it  does  not  converge  to  an  optimal  solution  quickly 
enough  to  be  useful  as  a  design  procedure,  it  was  sufficiently 
effective  to  be  useful  ;n  obtaining  optimal  solutions  to  a  num¬ 
ber  of  problems  of  sufficient  size  and  constraint  tightness  to 
be  able  to  make  observations  about  characteristics  of  optimal 
solutions  This  led  to  the  characterization  of  arc  subsets  which 
are  small  and  at  the  same  time  reasonably  effective  for  use  as 
candidates  ui  a  SOG a  In  particular,  it  was  found  that  arcs 
present  in  ot  lima]  solutions  but  not  present  in  heuristicaily 
generated  solutions  were  almost  always  MST  arcs  Wlule  it  is 
not  necessarily  so  that  all  arcs  present  in  the  optimal  solution 
and  not  in  the  I  sau-H  .i'.  .ms  solution  are  MST  arcs,  in  all  tm 
experiments  run  and  clicir.ru  to  this  end,  invariably  at  least 
one  arc  in  the  optimum,  but  not  m  the  Lsau-Williains  solution, 
was  an  M  SJ  arc 

IIils  is  not  to  say  that  it  is  conjectur'd  that  one  must  ai 
ways  be  able  to  find  an  optimum  solution  winch  contain:,  .n 


MST  arc  not  part  of  the  I.sau-WilUams  solution  (when  the 
f-.sau-Willianis  solution  itself  is  not  optimal)  It  merely 
strengthens  tile  point  that  MS  I  arcs  aie  reasonable  arcs  to  ex¬ 
amine  when  seeking  tc  improve  a  heuristic  solution. 

1  ven  without  this  evidence,  however,  it  is  intuitively  ap¬ 
pealing  to  consider  such  arcs  Most  prunal  heuristics  share  the 
property  that  they  proceed  by  connecting  nodes  to  their 
nearest  feasible  neighbors  (8]  The  algorithms  differ  only  in 
the  order  in  which  they  consider  the  nodes.  Thus,  the  solu¬ 
tions  differ  only  in  that  the  distance  to  a  node's  nearest  feasi¬ 
ble  neighbor  may  differ  depending  upon  when  the  node  is  con¬ 
sidered  during  the  execution  of  the  algorithm.  If  such  a  heuris¬ 
tic  fails  to  find  the  optimum,  it  will  be  because  the  algorithm 
erred  by  waiting  too  long  before  consideration  of  some  critical 
node  (or  nodes),  and  lienee  missed  being  able  to  include  some 
desirable  arc  lor  arcs)  in  Hie  solution  It  is  quite  likely  that  one 
or  more  ol  these  arcs  will  be  MST  arcs,  indeed,  as  mentioned 
above,  this  was  always  the  case  with  the  experiments  carried 
out. 

Thus,  a  heunstic  is  proposed  which  attempts  to  improve 
solutions  generated  by  some  prunal  heuristic  by  forcing  the 
inclusion  of  one  or  more  MST  arcs  which  the  primal  heuristic 
left  out.  This  gives  nse  to  the  following  procedure. 

Step  1 .  Generate  an  MST  of  arcs  Tm  =  {a;  |/  =  1,  -.A—  l}. 
Generate  a  feasible  tree  of  arcs  Tf  =  {6j|/  =  1,  A  —  I}. 
Find  5  =  Tm  —  (Tm  n  T f).  (For  problems  of  interest,  5  is 
nonempty.) 

Step  2:  For  each  subset  5,  C  5,  set  53  =  5—5 1.  Remove 
all  aj  €  5 j  from  the  network.  Start  a  solution  by  including  all 
a,  6  5 1  Apply  the  primal  heuristic  to  complete  the  generation 
of  a  solution. 

It  should  be  noted  that  an  alternative  procedure  would  be 
to  include  a,  in  5| ,  but  not  to  exclude  a,  in  52.  It  was  felt, 
however,  that  if  elements  in  52  were  retained  as  candidates  for 
inclusion,  duplicate  solutions  would  be  likely  to  result.  The 
exclusion  of  elements  in  52  guarantees  unique  solutions,  and 
hence  should  increase  the  likelihood  of  generating  improve¬ 
ments. 

COMPUTATIONAL  EXPERIENCE 

In  order  to  assess  the  effectiveness  and  efficiency  of  the 
procedural  scheme  described  in  the  previous  section,  a  series 
of  experiments  was  run.  Nodes  were  generated  with  random 
locations  over  a  unit  square  and  unit  traffic;  problems  with 
A'  =  40,  60,  90,  and  1  20  nodes  were  generated,  and  constraints 
on  the  number  of  nodes  in  subtrees  varying  between  m  =  A/20 
and  m  =  A'/4  were  examined. 

Alternatively,  a  constraint  on  traffic  could  have  been  used, 
but  we  did  not  feel  that  significantly  different  results  would 
have  been  obtained  Our  first  objective  was  to  examine  the 
running  time  of  our  implementation  of  the  Esau-VV’iUiams 
Algorithm  within  the  context  of  the  Unified  Algonthm  [81. 

Karnaugh  |t>)  desenbed  an  alternate  implementation  of  the 
Esau-'-Villianis  procedure,  and  reports  on  its  running  time  on 
an  IUK  ??0/l58.  Our  experiments  were  done  on  a  DEC 
KL  20/50  which  is  somewhat  slower  than  a  370/158.  In 
order  to  establish  a  basis  for  the  run  time  comparisons  with 
Karnaugh's  procedures,  shown  in  l  ig  2  and  Table  I,  we  first 
compared  our  implementation  of  the  FOGA  (of.  (8))  with  his. 
<W  found  tiial  both  pro.edures  ha.!  c  impanhle  run  times  and 
computational  complexil.es  (between  quadratic  and  cubic) 
for  the  nrst  FOGA  iteration  lie  .  (or  obtaining  an  initial  solu¬ 
tion).  but  that  ou’  uuplenu  t.GIk  n  w  as  ol  a  lower  order  of 
loinpOv.t)  .oi  laicr  in  raiivui:  !  in-  .nllcrc'ce  in  running  times 
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for  later  iterations  was  roughly  a  factor  of  5/2  for  A  --  120 
Since  most  of  the  execution  time  ot  the  o>erall  procedure  is  in 
iteration  of  the  FOG  A.  we  conclude  that  tins  coincides  with 
the  differences  m  running  times  between  the  two  versions  of 

Inhibit  in  Fig  2. 

Next,  by  running  a  sequence  of  120  problems  for  A  be¬ 
tween  10  and  100,  we  examined  the  size  of  the  subsets  S  gen¬ 
erated,  i.e.,  the  average  number  of  MST  links  not  present  in 
the  FOGA  solution.  Most  primal  heuristics  generate  subtrees 
which  axe  MST’s  on  the  set  of  nodes  they  contain  along  with 
the  center.  Most  of  the  arcs  in  these  subtrees  (with  the  excep¬ 
tion  of  the  arc  directly  connected  to  the  center)  will  be  MST 
arcs.  Thus,  one  might  expect  that  the  cardinality  of  S,  IS  I  =» 
N/m,  and  a  vast  improvement  can  be  made  over  a  blind  branch 
exchange  or  SOGA  procedure.  In  practice,  we  found  that  I S  ] 
ranged  between  N/m  and  N  log  N/m . 

There  are,  in  fact,  2"^'  subsets  of  S.  Thus,  for  moderately 
Urge  tightly  constrained  problems,  S  could  grow  large  enough 
to  make  evaluating  all  subsets  impractical.  Furthermore,  it  is 
reasonable  to  assume  that  not  all  arcs  in  S  interact  with  one 
another  i.e.,  improvements  in  separate  parts  of  the  network 
can  be  found  and  justified  independently  of  one  another. 
Thus,  the  heuristic  was  modified  to  only  consider  subsets 
St  C  S  such  that  |£t  |  <  K  for  some  given  A".  The  best  subset 
Si  •  is  found  and  permanently  forced  into  the  solution.  We 
then  set  S  =  S  —  S  (  *  and  repeat  the  procedure  until  no  further 
improvement  can  be  made.  In  practice,  K  -  7  worked  well. 
Experiments  were  run  with  larger  values  of  K ,  only  in 
isoUted  cases  was  any  improvement  over  A'  =  2  obtained. 
(Even  K  =  1  worked  well  in  many  cases.)  Thus,  it  was  decided 
in  all  the  remaining  experiments  to  use  Um  extended  proce¬ 
dure  and  restrict  the  examination  to  subsets  of  cardinality  less 
than  or  equal  to  two 

Using  this  modification  of  the  new  heuristic,  it  was  observed 
that  forcing  arcs  between  nodes  which  are  close  to  the  center 
seemed  to  have  the  greatest  effect  on  the  value  of  the  solu¬ 
tion.  This  was  probably  a  consequence  of  the  fact  that  the 
Esau-Williams  algorithm  starts  with  nodes  from  the  center, 
and  hence,  dealing  with  nodes  near  the  center  first  radically 
changes  the  solution  value. 

In  particular,  it  was  found  that  for  small  networks  ( N  <  20). 
there  was  little  difference  (usually  =£  1  percent)  in  perform¬ 
ance  between  the  new  heuristic  and  the  Esau-Williams  solu¬ 
tion.  This  is  almost  certainly  because  both  procedures  were 
generating  near-optimal  solutions  However,  for  larger  net¬ 
works,  particularly  for  tightly  constrained  problems,  the  new 
heuristic  performed  noticeably  better  with  improvements  aver¬ 
aging  about  1.5  percent.  Fig.  1  shows  the  improvement  relative 
to  the  Esau-Williams  algorithm.  As  can  be  seen,  the  improve¬ 
ment  increases  with  the  problem  size  Results  were  even  more 
encouraging  for  large  networks  with  the  central  site  in  Uie 
comer.  Such  networks  may  be  viewed  as  one-fourth  of  a  net¬ 
work  of  4,V  nodes  with  the  central  sue  in  the  center  For  such 
problems,  improvements  averaging  4  percent  and  as  high  as 
8  percent  were  observed.  Thus,  the  procedure  appears  to  be  of 
value  for  many  realistically  sized  problems  with  tight  con¬ 
straints. 

A  straightforward  implementation  of  the  Fsau-W  dliams 
procedure  has  a  computational  complexity  of  order  ,V‘  log;  N 
A  more  careful  implementation  ( 9]  can  reduce  the  complexity 
to  order  A’  log;  .V  As  discussed  previously,  '5|  w as  found  to 
range  between  N/m  and  N  log  N/m  Since  we  cxanunc  subsets 
of  cardinality  at  most  2.  1  subsets  are  examined  on  each 

iteration  At  worst.  |S|/2  successive  subsets  of  cardinality  2 
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Fig.  1.  Improvements  obtained  with  new  heuristic. 


Fig.  2.  Comparison  of  run  times. 

could  be  introduced,  and  thus,  the  procedure  could  iterate  at 
most  151/2  times.  In  practice,  the  number  of  iterations  grows 
more  slowly  than  1 5  |. 

Thus,  a  careful  implementation  of  the  procedure  has  a 
complexity  of  |5  1 3 A'  log  A'. 

Finally,  a  version  of  the  Inhibit  and  Join  procedures  (as 
described  in  [6]  ,  but  using  the  Unified  Algorithm  in  place  of 
the  alternate  implementation  of  Lsau-Williams)  was  coded  and 
run  on  another  set  of  problems,  and  the  heuristic  described  in 
this  paper  was  then  run  on  the  same  set  of  problems.  Thus,  we 
were  able  to  directly  compare  the  quality  of  the  obtained  solu¬ 
tions,  as  well  as  the  running  times  of  the  new  heuristic  with 
Inhibit  and  Join.  The  running  times  are  summarized  in  Fig.  2 
and  Tables  I  and  II.  As  can  be  seen  the  new  heuristic  is  much 
faster  than  Intubit  and  faster  than  Join.  Indeed,  it  is  only  two- 
three  times  slower  than  the  Unified  Algorithm. 

The  quality  of  the  solutions  obtained  varied.  Overall  the 
results  were  consistent  with  the  experiment  described  above, 
although  a  drop  in  effectiveness  is  noted  tor  the  120-node 
problems.  W'c  do  not  consider  tins  latter  phenomenon  signifi¬ 
cant  since  there  was  a  large  variation  in  individual  runs,  and 
the  first  experiment,  which  was  base  1  on  a  much  larger  nuni 
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PERCl  NT  1M  HROVF  M  ENTS  IN  PERFORMANCE  OVER 
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Number  of  Node* 

Algor  ithm 

40 
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60 

•■*0 
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120 

4ver*}» 

L*i  L  i  b  1 1 

2  Z 

i.  o 

3  fc 

2  b 

2  e 

New  Heart *t»c 

2.  4 

1  s 

2.  b 

C.  ti 

l  * 

Joui 

‘  — — 

l.  1 

1  s 

2.  0 

0.  > 

1  t 

'  er  t>:  problems.  exhibited  no  similar  behavior  In  some  cases, 
the  new  heuristic  outperformed  Inhibit,  tn  others,  Inhibit 
performed  better.  On  the  whole,  the  quality  of  the  solutions 
obtained  with  Inhibit  was  2.6  percent  better  than  those  ob- 
t. line-.;  using  the  i-sau-Wiliiams  procedure,  while  those  obtained 
using  the  new  heuristic  were  1.9  percent  better  than  Esau- 
"  alums.  Batli  Inhibit  and  the  new  heuristic  outperformed 
Join  lairiy  consistently ,  Joui  averaged  a  1  b  percent  unprove- 
:  ,ent  over  Lsau-Williams. 

CONCLUSION 

We  found  the  new  heuristic  to  be  of  definite  value  as  it 
■  :  tamed  solutions  roughly  2  percent  better  than  the  Lsau- 
W  till  a  ms  procedure  without  greatly  increasing  the  running 
tune.  In  comparison  with  Kamough's  SOtiA's  the  new  heuris¬ 
tic  is  much  faster  and  obtains  solutions  somewhat  better  than 
Join  and  somewhat  worse  than  Inhibit.  We  conclude  that  the 
new  heuristic  is  uselul  as  a  practical  design  procedure,  even 
when  imbedded  ui  a  larger  procedure  which  solves  more 
global  problems 
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Abstract 

The  problem  considered  is  that  of  finding  an  optimal  (minimum 
cost)  design  for  a  centralized  processing  network  given  a  set  of 
locations,  traffic  magnitudes  between  these  locations,  and  a  single 
common  source  or  destination.  Several  heuristics,  which  are  efficient 
(in  terms  of  their  execution  time  and  memory  requirements  on  a 
digital  computer)  and  which  produce  seemingly  good  results,  have 
already  been  developed  and  are  currently  accepted  techniques.  Some 
work  has  also  been  done  on  finding  optimal  solutions  to  this  problem 
both  as  a  design  tool  and  as  a  means  of  verifying  the  effectiveness  of 
proposed  heuristics.  We  focus  in  this  latter  area.  Currently  known 
techniques  for  the  optimal  solution  of  this  problem  via  integer  pro¬ 
gramming  have  fallen  short  of  the  desired  objectives  as  they  require 
too  much  memory  and  running  time  to  be  able  to  treat  problems  of 
realistic  size  and  complexity.  We  develop  an  improved  technique 
which  is  capable  of  handing  more  realistic  problems. 
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Contract  No.  DAAK  80-80- k-0579  ,  and  bv  the  National  Science  Poun- 
dation,  Grant  No.  RNG-7908120 


1  .  INTRODUCTION  AND  PROBLEM  STA  1'I.Ml'AT 

Iho  problem  considered  is  that  ot  finding  an  optimal  (minimum 
cost.)  design  tor  a  centralized  telecommunication  network  given  a  set 
of  locations,  traffic  magnitudes  between  these  locations,  and  a  single 
common  source  or  destination.  The  vast  majority  ot  telecommunication 
networks  currently  in  existence  are  ol  this  type.  Thus,  this  problem 
has  been  much  studied  (2,3,4,5,8,9,12,16,24,28,31,32). 

Several  heuristics,  which  are  efficient  (in  terms  of  their  execu¬ 
tion  time  and  memory  requirements  on  a  digital  computer)  and  which 
produce  seemingly  good  results,  have  already  been  developed  and  are 
currently  accepted  techniques.  Some  work  has  also  been  done  on 
finding  optimal  solutions  to  this  problem  as  a  means  of  verifying  the 
effectiveness  of  proposed  heuristics.  Currently  know  techniques  for 
the  optimal  solution  to  this  problem  via  integer  programming  have 
fallen  short  of  the  desired  objective  as  they  require  too  much  memory 
and  running  time  to  be  able  to  treat  problems  of  realistic  size  and 
complexity.  We  develop  an  improved  technique  which  is  capable  of 
handling  problems  of  realistic  size. 

More  formally,  the  problem  considered  here  is  that  of  finding  a 
minimum  spanning  tree  subject  to  one  or  more  constraints  which  in 
qeneral  are  equivalent  to  demanding  that  the  sum  of  the  traffic  asso¬ 
ciated  with  the  nodes  in  any  subtree  must  not  exceed  some  predeter¬ 
mined  maximum. 

A  minimum  spanning  tree  is  a  loop-lree  collection  of  arcs  |0imng 
a  set  of  nodes  such  that  the  sum  of  the  lengths  of  the  arcs  is  mini¬ 
mal.  In  the  case  of  a  communication  network,  these  collections  of 
arcs  are  called  multidrop  lines. 
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It  should  be  noted  that  this  constraint  form  is  quite  general  and 
encompasses  many  real-world  constraints  which  arise  in  the  design  ot 
centralized  telecommunications  networks.  Thus,  for  example,  in 
addition  to  treating  the  obvious  constraint  imposed  by  line  capacity, 
it  is  possible  to  treat  a  restriction  on  the  number  of  terminals  on  a 
multidrop  line  by  associating  a  uniform  traffic  with  each  terminal. 
Also,  the  length  (cost)  functions  which  can  be  treated  are  quite 
general.  Any  function  which  is  not  a  function  of  the  tree  chosen  is 
permissable . 

Formally,  we  seek  to  solve  the  following  problem: 

Given 

1.  A  vertex  (node)  set  V  =  { Vj  |  i=0 , 1 , . . .  , n }  representing  the 

terminal  locations  in  the  network.  Node  vQ  is  a  distin¬ 

guished  node  which  we  will  refer  to  as  the  center. 

2.  A  symmetric  function  giving  the  length  (cost)  d^  of  an  arc 
between  any  pair  of  locations. 

3.  A  constraint,  m,  on  the  number  of  nodes  which  may  share  a 

multidrop  line.  This  constraint  can  be  generalized  to  allow 

a  weight  or  traffic,  Cj,  to  be  associated  with  each  node  and 

to  require  that  the  sum  of  the  weights  associated  with  the 

nodes  on  any  multidrop  line  not  exceed  m. 

t  h 

We  define  the  set  of  nodes  in  the  j  multidrop  line  to  be  \T  and 

the  multidrop  line  itself  to  be  a  minimal  spanning  tree  T^  on  V.^{vq}. 

1 

Thus,  the  constraint  can  be  stated  in  terms  of  the  cardinality  of  \A 

as  | V . |  <  m  V..  In  the  more  general  form,  the  constraint  would  be 
J  )  * 

2  c  ^  m  V  .  We  wish  to  find  a  tree,  Tw  of  minimum  total 
i  j  V 

v.  r.  V 

i  J 

length  satisfying  the  constraint  in  3  above. 


•• 


That,  is,  we  wish  to 


is  the1  immediate  predecessor 


N 

minimize  ^  d  suh]ert  to  1 ,  where  vu 
i=l  u  i 

of  v.,  i.e.,  the  node  closest  to  v  on  the  path  between  v.  and  vM  in 

T,  and  T  is  any  spanning  tree.  We  consider  exact  (optimal)  solutions 

to  this  problem.  The  primary  motivation  for  the  work  is  to  develop 

an  exact  algorithm  capable  of  permitting  study  ol  the  performance  ot 

heuristics  on  a  broader  class  of  problems  than  was  previously  studied, 

to  gain  insight  into  the  performance  of  both  exact  and  heuristic- 

procedures  and,  in  particular,  to  pinpoint  where  and  why  they  fail. 


II.  OLTl.lNi:  OF  A  NLW  OPTIMAL  SOLUTION  tf.chniquf. 

Thert'  rurrenty  exist  seN.cf.il  techniques  which  will  yield  optimal 
solutions  to  the  UMST  problem.  These  techniques  can  be  divided  into 
two  classes  -  branch  exchange  methods  (as  proposed  by  Tin  (22)  and 
Frank  (9))  and  branch  and  bound  methods  (10,22).  We  concentrate 
on  the  latter  class  of  techniques. 

The  specific  application  of  branch  and  bound  techniques  to  the 
solution  of  the  CMST  problem  was  proposed  by  Chandy  and  Russell 
(3)  and  was  subsequently  refined  (2)  so  that  it  could  treat  somewhat 
more  meaningful  problems.  Subsequently,  F.iias  and  Fergus  on  (4) 
proposed  further  refinements  and  thereby  expanded  the  range  of 
applicability  of  the  technique.  (Lavish  (34)  recently  developed  a 
bound  using  Lagranqean  relaxation. 

The  basic  technique  is,  as  has  already  been  mentioned,  a  branch 
and  bound  algorithm.  The  original  problem  considered  has  all  branches 
in  the  category  "permissible,"  i.e.,  any  branch  may  or  may  not  be 
part  of  the  final  solution.  Subproblems  are  generated  by  selecting  a 
permissible  branch  and  making  it  "prohibited"  in  one  subproblem  or 
"required"  in  another. 

The  relaxation  used  is  simply  to  generate  a  modified  MST  by 
including  all  "required"  branches,  excluding  all  "prohibited"  branches, 
and  forming  the  tree  of  minimum  total  length  by  connecting  (as  yet 
unconnected)  nodes  using  remaining  ("permissible")  branches. 

Clearly,  a  solution  obtained  in  this  manner  is  a  lower  bound  on 
the  value  of  a  feasible  solution  to  the  subproblem  as  it  is  the  tree  of 
minimum  length.  Note  also  that  in  the  case  where  all  arcs  are  speci¬ 
fied  (prohibited  or  required),  the  lower  bound  and  solution  are 


identical  and  the  subproblem  fathoms.  In  general,  the  subproblem 
fathoms  when 

1.  No  feasible  solution  exists  to  the  subproblem.  This  occurs, 
when  the  required  branches  form  a  loop,  when  the  required 
branches  create  a  subtree  violatinq  the  constraints,  or 
when  the  prohibited  branches  disconnect  the  network. 
Other  criteria  exist  but  are  difficult  to  test  for. 

2.  The  lower  bound  equals  or  exceeds  the  value  of  the  best 
solution  found  thus  far. 

3.  The  lower  bound  solution  is  feasible. 

When  all  subproblems  have  fathomed,  the  current  best  solution  is 
the  global  optimum. 

A  number  of  observations  have  been  made,  which  can  be  used  to 
accelerate  a  basic  branch  and  bound  technique.  One  of  these,  which 
is  used  in  the  sequel,  is  given  in  Theorem  1  below: 

Theorem  1:  (3) 

If  branches  (v  v.  ),  ( v.  ,v.  ),...,  (vn,v.  ),  are  part  of 
)  i  u  )o  u  K 

some  MST,  T,  on  V  then  there  exists  a  CMST  including 

these  edges. 

Corollary : 

If  arcs  (v.,v.  ),  (v„,v.  ),...,  (vn,v.  )  are  present  in  the 
u  I]  ^  O  '  I  k 

modified  MST  produced  in  any  subproblem,  then  (it  any 
CMST's  exist  in  the  subproblem)  there  exists  a  CMST  on 
the  subproblem  containing  these  arcs. 

The  preceding  theorem  and  corollary  allow  one  to  avoid  consider¬ 
ing  subproblems  with  such  arcs  prohibited.  The  techniques  devel¬ 

oped  in  the  sequel  make  explicit  use  of  both  observations  as  well  as 
others  made  in  the  references  cited. 
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The  inherent  problem  with  the  existing  procedures  lies  in  the 
relaxation  method  used.  At  each  step,  the  problem  is  relaxed  to  a 
modified  MST.  Unfortunately,  this  bound  is  often  too  loose  to  elimi¬ 
nate  a  sufficiently  large  percentage  of  the  subproblems  to  make  the 
procedure  practical.  This  is  particularly  evident  when  the  constraints 
are  tight;  it  is  for  such  problems  that  the  relaxation  is  loosest. 
Unfortunately,  it  is  also  for  that  class  of  subproblems  that  the  known 
heuristics  display  the  widest  variation  in  the  quality  of  solutions. 

Note  that  any  optimal  solution  to  the  CMST  problem  has  the 
property  that  all  subtrees  are  MST's  on  the  set  of  nodes  contained  in 
the  subtree  and  the  center.  Thus,  it  suffices  to  find  the  optimal 
partition  of  the  nodes  into  subtrees.  The  technique  which  is  devel¬ 
oped  in  the  following  sections  wiil  thus  generate  partitions  of  the 
nodes. 

The  technique  works  within  the  framework  of  branch  and  bound 
algorithms,  as  did  the  techniques  referred  to  above.  We  develop  two 
algorithms,  one  based  on  generating  subproblems  by  restricting 
nodes,  and  the  other  based  on  generating  subproblems  by  restricting 
arcs.  These  techniques  differ  from  previous  ones  in  that  the  relaxa¬ 
tion  used  here  is  tighter  and  thus,  a  smaller  number  of  subproblems 
need  be  examined. 

We  begin  by  restricting  the  problem  slightly.  We  seek  a  CMST 
subject  to  the  constraint  that  the  number  of  nodes  (rather  than  the 
sum  of  the  weights  of  the  nodes)  in  any  subtree  not  exceed  a  pre¬ 
specified  maximum.  Since  our  primary  intent  here  is  to  study  the 
performance  of  CMST  algorithms,  this  modification  would  not,  in 
general,  have  a  significant  effect.  Indeed,  if  one  preferred,  a  node 
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of  weight  K  could  be  replaced  by  K  nodes  of  weight  1 ,  providing  one 
is  willing  to  allow  the  original  node  of  weight  K  to  be  split  among 
more  than  one  subtree. 

To  find  a  partition  of  the  nodes  V  =  [v.|  i  =  1,2, ...n}  into 
subtrees,  we  begin  by  making  n  copies  of  each  node  corresponding  to 
the  possibilities  of  the  node  being  in  any  of  n  possible  subtrees. 
Thus,  corresponds  to  node  v.  being  in  subtree  j. 

The  problem  of  obtaining  an  optimal  partition  of  nodes  into 
subtrees  can  be  thought  of  as  one  of  selecting  an  optimal  subset  from 
the  set  E  =  {v^l  i=l,2,...n;  j=l,2,...n}.  Feasible  subsets  of  E, 
i.e.,  those  corresponding  to  partitions  satisfying  the  capacity  con¬ 
straint,  will  contain  1  v..  for  each  i  and  at  most  m  v^'s  for  each  j. 
If  we  associate  a  weight,  w.j,  with  each  v-,  the  optimal  subset  of  E 
(hence  the  optimal  partition  of  V)  is  defined  as  the  feasible  subset  of 
minimum  total  weight. 

An  efficient  algorithm  (see  14,  21,  35)  exists  for  the  solution  of 
the  problem  of  finding  the  optimal  subset  of  E  given  the  values  of 
w^.  The  algorithm,  which  can  be  thought  of  as  a  matroid  intersec¬ 
tion  [1,17,19,20,29]  algorithm  or  alternatively  as  a  series  of  shortest 
path  problems  in  appropriately  defined  graphs,  has  a  worst  case 
running  time  of  order  n3  and  in  practice  has  a  running  time  closer  to 
order  n2  (see  35).  Unfortunately,  the  set.  of  weights,  w^,  which 
correspond  directly  to  the  "cost"  (contribution  to  the  overall  length 
of  the  CMST)  of  Vj  in  subtree  j  can  be  specified  only  when  the  pro¬ 
blem  solution  is  already  known.  We  can,  however,  define  a  set  of 
weights,  w. ^ ,  which  have  the  property  that  the  optimal  partition  found 
using  these  weights  will  have  a  value  (sum  of  weights)  w'hich  is  a 


« 


-8- 


lower  bound  on  the  length  of  the  optimal  CMST.  Thus,  we  can  relax 
the  CMST  problem  to  the  problem  of  finding  an  optimal  partition. 
This,  together  with  generating  subproblems  by  successively  restrict¬ 
ing  either  nodes  or  arcs,  gives  rise  to  an  optimum  CMST  algorithm 
within  the  branch  and  bound  context. 

An  appropriate  set  of  weights,  w..,  can  be  defined  as  follows. 
Suppose  we  are  given,  for  each  subtree  j,  the  set  V\  of  nodes  per¬ 
mitted  in  the  subtree;  a  procedure  for  obtaining  the  V.  will  be  given 
below.  One  can  then  find  '18,  the  minimum  spanning  tree  on  the 
nodes  in  V8  ^  {vq}.  Let  ^jj  *3e  len9th  of  the  arc  connecting  v^ 
to  its  predecessor  in  T.  ( i .  e .  ,  d^  is  the  length  of  the  last  arc  in  the 
path  from  vQ  to  v.  in  Tj).  The  following  theorem,  which  is  proven  in 
(35),  allows  us  to  obtain  appropriate  w. . : 

Theorem  2:  The  weight  of  the  optimal  partition  using  w..  =  d^  is  a 
lower  bound  on  the  length  of  the  CMST  for  the  same  V  and  M. 

Furthermore,  it  is  proven  in  (35)  that  other  similarly  defined 
Wj8s  also  preserve  this  lower  bound.  In  particular,  suppose  38 
contains  a  path  (v0----vp-  vq  '  •  •  •  vx  -  •  •  •  vy  -  •  •  • )  as  shown  in  Figure 
1.  Let  S  be  the  set  of  nodes  {  vq «  •  •  •  v(<  >  ■  ■  ■  VS1  and  let  w^  be  the 
largest  weight  of  any  node  in  S.  Suppose  w^  >  w^. .  Define  A  = 
Wpj  -  w^j.  Then  the  following  theorem  holds: 

Theorem  3:  If  a  set  of  weights  w..  =  d^  is  modified  by  transferring 
weight  A  from  Wp^  to  w^  where  A,  v^  and  vg  are  defined  as  above, 
the  weight  of  the  optimal  partition  is  still  a  lower  bound  on  the  length 
of  the  CMS3’  for  the  same  V  and  m. 


Theorem  3  allows  us  to  transfer  weight  from  a  node  to  its  suc¬ 
cessors  in  38  in  order  to  guarantee  that  the  lower  bound  obtained 
J 
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from  the  partitioning  problem  is  at  east  as  tight  as  the  bound  ob¬ 
tained  using  an  MST,  as  is  done  .1  (2),  (3),  and  (4).  A  proof  that 

this  can  always  be  done  is  giv  a  in  (35).  As  an  example  of  how  this 

works,  consider  the  network  shown  in  Figure  2a.  The  MST  for  this 

network  and  the  node  weights  corresponding  to  it  are  shown  in  Figure 
2b.  These  weights  correspond  to  the  bound  obtained  using  an  MSP. 
Suppose,  however,  that  we  restrict  v3  from  being  part  of  a  given 
subtree  j.  The  weights  shown  in  Figure  2c  would  then  be  obtained  if 
we  simply  set  w..  =  d...  Note  in  particular  that  w„.  has  been  reduced 
from  5  to  1 .  This  reduction  in  w^j  could  result  in  a  loosening  of  the 
lower  bound.  Theorem  3  allows  us  to  transfer  up  to  A  =  w^  -  W£j, 

i.e.,  7  units  of  weight,  from  w^  to  w^j  and  obtain  the  weights  shown 

in  Figure  2d.  Note  that  the  w.^  in  Figure  2d  are  at  least  as  great  as 
the  w.j  in  Figure  2b.  Thus,  the  lower  bound  obtained  using  the  w^ 
in  Figure  2d  will  be  at  least  at  tight  as  that  obtained  using  an  MST. 

In  fact,  the  bound  so  obtained  is  significantly  tighter,  as  is  shown 

by  the  computational  experience  given  in  Section  V. 

We  now  turn  to  the  question  of  the  branching  rule  within  the 
branch  and  bound  procedure.  Little  was  said  by  Chandy  and  Russell, 
Chandy  and  Lo,  and  Elias  and  Ferguson  on  the  order  in  which  sub¬ 
problems  are  considered  in  the  branch  and  bound  procedure.  Classi- 
cially,  two  approaches  are  available.  The  first  is  to  always  consider 
the  subproblem  with  the  least  lower  bound.  Alternatively,  one  can 
use  depth  first  search,  where  one  always  solves  most  recently  gener¬ 
ated  subproblems  before  returning  to  older  subproblems.  There  are 
advantages  and  disadvantages  to  both  approaches. 
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The  first  approach  allows  one  to  proceed  without  any  good 
feasible  solutions  to  guide  the  process.  The  assumption  is  that 
subproblems  with  the  lowest  lower  bounds  will  give  rise  to  the  best 
feasible  solutions.  Hence,  one  prefers  to  explore  these  subproblems 
first  in  the  hope  that  they  will  give  rise  to  low  cost  feasible  solutions 
which  will  eliminate  other  subproblems  (with  higher  lower  bounds) 
from  consideration.  Also,  by  examining  subproblems  in  this  order, 
one  is  continually  narrowing  the  range  between  the  upper  and  lower 
bounds,  and  hence,  has  the  option  of  terminating  the  algorithm  when 
the  interval  shrinks  to  some  prespecified  width. 

There  are,  however,  two  major  drawbacks  to  this  approach. 
First,  one  must  keep  (a  potentially  large  number  of)  subproblems 
around  in  order  to  select  the  next  one.  Thus,  the  storage  required 
for  the  procedure  is  potentially  exponential.  In  practice,  it  was 
storage,  not  running  time,  which  was  the  active  constraint  on  problem 
size  in  previously  developed  techniques.  One  could  temporarily  store 
subproblems  in  secondary  storage,  but  this  would  complicate  and  slow 
down  the  procedure. 

Second,  by  considering  problems  in  ascending  order  of  lower 
bound,  one  will,  in  general,  be  sequentially  considering  dissimilar 
subproblems.  Thus,  one  cannot  easily  take  advantage  of  information 
obtained  in  the  solution  of  one  problem  for  the  solution  of  another. 
For  example,  in  the  Elias  and  Terguson  technique,  the  similarity 
between  modified  MST's  for  related  subproblems  cannot  be  easily 
exploited  if  this  first  procedural  outline  is  adopted. 

Using  depth  first  search  overcomes  both  of  these  objections. 
Indeed,  a  great  deal  of  simplification  is  obtainable  both  in  the  genera- 


tion  of  subproblems  and  in  obtaining  solutions  owing  to  the  similarity 
of  successively  considered  subproblems. 

The  maximum  number  of  subproblems  which  need  be  kept  around 
at  any  time  is  bounded  by  the  number  of  nested  specifications  it  is 
possible  to  make.  Thus,  if  one  is  restricting  nodes,  the  bound  is  n; 
if  one  is  including  or  excluding  arcs,  the  bound  is  (£)■  This  essen¬ 
tially  eliminates  storage  as  an  active  constraint  on  the  size  of  the 
problem  which  can  be  considered. 

A  further  reason  for  using  depth  first  search  is  that  the  major 
reasons  one  would  ordinarily  choose  the  first  procedure  are  not 
present  here.  Any  of  the  existing  heuristics  can  be  used  to  quickly 
generate  a  good  upper  bound.  Furthermore,  the  procedures  devel¬ 
oped  in  the  sequel  lend  themselves  to  generating  feasible  solutions  for 
all  subproblems.  Thus,  a  good  upper  bound  is  always  available. 

Hence,  depth  first  search  is  used  in  developing  the  techniques 
in  the  sequel.  It  should  be  further  noted  that  the  philosophy  used 
in  developing  these  techniques  was  to  create  the  simplest,  most  flex¬ 
ible  framework  within  which  to  work  so  that  a  variety  of  acceleration 
techniques  could  be  developed  and  tested.  The  concentration  is  on 
restricting  the  number  of  subproblems  examined  (which  is  exponential 
in  n)  rather  than  the  amount  of  work  spent  on  each  subproblem 
(which  is  a  low  order  polynomial  in  n). 


III.  NODE  PARTITIONING 


The  first  exact  technique  built  around  the  above  relaxation  is 
one  which  generates  subproblems  by  restricting  the  subtrees  a  node 
is  allowed  into.  The  procedure  is  described  below.  We  begin  by 
describing  the  initialization  procedure. 

Step  0:  (Initialize) 

* 

0.1)  Find  an  upper  bound,  z  ,  using  a  heuristic  to  generate 
a  good,  feasible  solution. 

0.2)  Find  an  MST,  T,  and  identify  arcs  (vn,v.  ),  (vn,v.  ), 

U  1-1  U  1 


0.3) 


0.4) 


0.5) 

0.6) 

0.7) 

0.8) 


Reorder  the  nodes  so  that  v.  ,v.  ,...v.  are  now  v. , 

*1  '2  *k  1 

vo . vu- 


For  1  f  n,  set  R-. 


r  TRUE  i  *  j 
l  FALSE  i  =  j 


[R^  is  a  logical  variable  which  is  set  to  TRUE  if  v..  has 
been  removed  from  consideration  in  this  subproblem. 


Observations  made  above  allow  us  to  remove  some  v.j 
immendiately  ] . 


For  k  <  i  <  n,  set  R..  = 


( 


TRUE  i  >  j 
FALSE  i  <  j 


For  i  <  j  <  n,  find  an  MST,  T. ,  on  V.  if  vq,  where 
Vj  =  WjIRjj  =  FALSE}  i.e.,  \L  is  the  set  of  nodes 
permitted  in  subtree  j. 

Set  w.j  =  djj,  where  dj.  is  the  distance  from  Vj  to  its 
predecessor  in  T.. 


For  j  =  1,2,...,  k,  exchange  weight  between  w.^  for 


different  values  of  i  so  that  the  resulting  modified 


weights,  w^  ,  satisfy: 
wij  -  wi0 

where  w.q  =  din  in  the  unconstrained  MST,  T,  gener- 


iP 


ated  in  Step  0.2  above. 

0.9)  For  j  =  k+1,...,  n,  exchange  weight  between  w^  so 
that  the  resulting  w.t  satisfy 

wi’i  >  wi'j  - 1 

The  justification  for  all  of  these  steps  was  given  in  Section  2. 

Steps  0.8  and  0.9  guarantee  that  the  individual  w.’.  will  all  be  at 
least  as  great  as  the  weights  assigned  using  unconstrained  MST. 
Hence,  this  is  a  realization  of  the  statement  that  the  lower  bound 
obtained  using  this  procedure  must  be  at  least  as  great  as  the  lower 
bound  obtained  using  an  MST.  This  also  holds  true  for  subproblems. 

Thus,  we  have  initialized  a  subproblem  with  nodes  1,2 .  k  torced 

into  subtrees  1,2,...,  k,  respectively,  since,  for  i  <  k,  R^  =  TRUE 
for  i  f  j.  In  the  course  of  the  depth  first  search,  we  keep  track  of 
the  following  variables: 

d  =  The  depth  of  the  search,  i.e.,  the  number  of  nodes 
which  have  been  forced,  d  is  initialized  to  k. 


MIN 


IW. 


The  minimum  allowable  depth,  d^j^  is  initialized  to  k 
since  at  least  k  nodes  should  will  be  forced. 

The  subtree  which  the  d1"*1  node  is  forced  into. 


The  depth  first  search  proceeds  by  forcing  node  k+1  into  sub¬ 
tree  1;  i.e.,  d  is  set  to  k+1  and  IW^  is  set  to  1.  It  continues 
either  by  increasing  d  (to  force  another  node)  changing  IW^  (to  force 
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a  node  into  a  different  subtree)  or  decreasing  d  (to  release  a  node 
after  forcing  it  successively  through  all  subtrees).  The  depth  first 
search  procedure  follows. 

Depth  First  Search  Procedure 

Step  0:  Initialize  problem  (Steps  0.1  through  0.9  above). 

Set  d  =  k+1 

Set  djYjjjvj  -  k+1  Set  IWd  -  1 

Step  1:  Solve  the  currently  defined  subproblem;  i.e.,  find  a 
lower  bound  Zp,  and  a  feasible  solution  Zp. 

Step  2:  If  the  current  subproblem  fathoms;  i.e.,  Zp  >  Zp,  go 
to  Step  3;  otherwise  go  to  Step  5. 

Step  3:  Set  IW^  =  IW^  +  1 

If  IWj  >  NMAXd  go  to  Step  4;  otherwise  set  up  a  new 
subproblem  and  go  to  Step  1.  NMAX^  is  the  highest 
indexed  subtree  which  the  node  at  depth  d  may  be 
forced  into.  In  section  3  we  observed  that  one  should 
not  skip  over  subtrees.  Thus: 

NMAX^  =  max  (K,  max  [ IW. ] )  K  <  i  <  d 
Step  4:  Set  d  =  d+1 

If  d  <  dMIN  stop;  otherwise  set  up  a  new  subproblem 
and  go  to  Step  1. 

Step  5:  Set  d  =  d+1 
Set  IWd  =  1 

Set  up  new  subproblem  and  go  to  Step  1. 

To  set  up  a  new  subproblem,  one  need  only  modify  the  values  of 
a  few  Rjj  to  impose  (or  remove)  the  restriction  implied  by  the  alter¬ 
nation  of  d  and  IW^.  Thus,  after  d  is  set  to  d+1  and  IW^  is  set  to  1: 
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Rdi  = 


FALSE  j  =  1 
TRUE  j  >  1 


After  IWd  is  set  to  IWd  +  1 
Rd,IWd  =  false 


R  .  TU>  -1  =  TRUE 
d'IVvd 

After  d  is  set  to  d  -  1: 

r  FALSE  j  <  NMAXd 
V.  TRUE  j  >  NMAXd 


R 


d+lj 


The  search  space  can  be  further  pared  using  the  corollary  to 
Theorem  1.  If,  in  any  subproblem,  one  finds  that  two  nodes,  v.  and 
Vj,  both  forced  into  the  same  subtree  appear  in  separate  subtrees  in 
the  MST  formed' on  the  set  of  permissable  nodes  in  that  subtree,  then 
the  subproblem  may  be  discarded. 

As  was  mentioned,  this  optimal  technique  based  on  generating  a 
partition  lends  itself  simply  to  obtaining  a  feasible  solution  to  each 
subproblem.  The  partition  generated  at  each  step  is  feasible.  One 
need  only  generate  MST's  on  each  group  of  nodes  to  obtain  a  feasible 
solution.  A  simple  acceleration  technique,  which  proved  to  be  quite 
effective  in  practice,  was  to  reorder  the  nodes  v^+1,...,  vn  by 
distance  from  Vq,  nearest  first.  This  tended  to  increase  the  lower 
bound  most  rapidly.  Such  nodes,  when  restricted  to  a  single  sub¬ 
tree,  were  absent  from  all  others,  and  the  "deprived"  subtrees  were 
often  forced  to  connect  to  v^  over  longer  arcs. 

The  Rj.  are  used  in  the  optimal  partitioning  procedure  in  a 
straight  forward  fashion;  any  element  v..,  with  its  corresponding  R ■  • 
=  TRUE,  is  considered  to  be  removed  from  the  problem. 
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The  weight  exchange  procedure  described  as  part  of  the  initiali¬ 
zation,  which  guarantees  that  the  weight  on  each  node  will  be  at  least 
as  great  as  its  contribution  to  the  length  of  an  MST,  is  used  here  as 
well.  At  each  level,  K,  in  the  decision  tree,  we  save  the  values  of 
the  w^  in  a  variable  referred  to  as  wV1..  We  then  demand  that  w.'.  = 
w^j  1  *  e  «  we  exchange  weights  to  enforce  the  restriction.  The 

justification  for  doing  so  is  identical  to  that  used  in  the  initialization 
procedure.  Note  that  this  exchange  guarantees  not  only  that  the 
lower  bound  will  remain  tighter  than  an  MST,  but  also  that  the  lower 
bound  will  be  monotone  with  the  depth  in  the  decision  tree.  Neither 
of  these  things  is  true  without  the  exchange. 


\0 


Another  method  of  applying  this  relaxation  technique  to  the 
solution  to  the  CMST  problem  is  to  restrict  arcs ;  i .  e . ,  to  force  arcs 
to  be  either  "prohibited"  or  "required"  as  was  done  by  Chandy  and 
Russell  and  Elias  Ferguson.  Thus,  the  initialization  and  subproblem 
solution  are  essentially  the  same  as  they  were  in  the  technique  des¬ 
cribed  in  the  previous  chapter,  but  the  method  of  generating  subpro¬ 
blems  is  different.  The  solution  order  is  still  a  depth  first  search. 

Some  differences  exist  in  the  initialization  procedure.  Instead  of 
forcing  a  node  into  a  subtree,  we  simply  "require"  the  arcs  (i^,0), 
...,  (iK,0)  which  are  part  of  the  unconstrained  MST.  This  is,  of 
course,  equivalent  to  what  was  done  in  the  previous  case.  It  is 
implemented  in  a  slightly  different  way,  however. 

The  entire  procedure,  both  during  initialization  and  during 
subsequent  subproblem  generation,  restricts  itself  to  dealing  with 
established  arcs ,  i .  e . ,  arcs  which  connect  a  node  directly  to  Vq  or  to 
other  nodes  connected  to  Vq  by  established  arcs.  Thus,  each 
"required"  arc  forces  a  node  into  a  given  subtree  and  each 
"prohibited"  arc  forces  a  node  out  of  a  given  subtree.  As  a  new 

subtree  is  encountered  (i.e _ ,  when  an  arc  of  the  form  (vQ,  Vj)  is 

made  "required"),  we  simply  assign  the  next  available  subtree  number 
to  the  subtree. 

Subproblems  are  generated  by  successively  restricting  (requiring 
or  prohibiting)  established  arcs.  We  again  use  d  to  represent  the 
depth  of  the  search.  Here,  however,  d  refers  to  the  number  of 
forced  arcs  rather  than  the  number  of  forced  nodes.  Note  that  while 


the  number  of  required  arcs  is  limited  to  n,  the  number  of  prohibited 
arcs  is  not.  We  thus  have  a  different  type  of  decision  tree  than  we 
did  in  the  previous  section. 

When  forcing  nodes  into  subtrees,  we  dealt  with  a  tree  of  depth 
n  but  with  nodes  of  degree  sometimes  as  great  as  n.  Here,  we  deal 
with  a  binary  tree  of  depth  as  great  as  C^).  It  is  not  clear,  how¬ 
ever,  especially  with  the  paring  techniques  being  used,  which  deci¬ 
sion  tree  is  actually  larger. 

The  arc  chosen  for  inclusion  is, in  each  case  the  next  arc  to  be 
brought  in  by  Prim's  MST  Algorithm  (25);  i.e.,  the  shortest  arc 
connecting  a  node  to  some  node  connected  to  vQ  by  required  arcs. 
This  has  several  advantages: 

1.  The  arc  chosen,  if  excluded,  will  tend  to  raise  the  lower 
bound.  This  is  important  as  it  helps  control  the  si2e  of  the 
decision  tree.  Since  a  potentially  large  number  of  succes¬ 
sive  arc  exclusions  is  possible,  it  is  important  that  an  arc 
exclusion  result  in  an  increase  of  the  lower  bound  as  often 
as  possible  so  that  the  fathoming  process  will  limit  the 
depth  of  the  search. 

2.  If  the  arc  is  of  the  form  (v^,  v.),  it  need  only  be  consid¬ 
ered  as  "required"  and  not  "prohibited."  This  is  a  direct 
consequence  of  Theorem  1. 

3.  If  the  arc  (v.,  v.)  is  prohibited,  and  hence,  v^  is  excluded 
from  the  subtree  containing  v.,  then  so  are  all  arcs  of  the 
form  (v. ,  v„),  where  vv  is  forced  into  the  same  subtree  as 
v..  This  is  a  direct  consequence  of  an  Elias  and  Ferguson 


V.  COMPUTATIONAL  EXPERIENCE 

The  procedures  described  in  the  previous  two  sections  were 
coded  in  FORTRAN  and  run  a  PDP-10.  In  this  secton,  we  discuss 
the  results  of  experiments  run  to  test  the  behavior  of  their  run  time 
and  effectiveness  as  a  function  of  problem  size  and  constraint  tight¬ 
ness. 

Problems  were  generated  by  reading  in  n  and  m  and  generating 
random  X  and  Y  coordinates  for  the  nodes  within  a  unit  square.  The 
location  of  the  center  was,  in  various  problems,  either  random, 
centered,  or  in  the  corner.  Euclidean  distances  were  used.  Most 
experiments  were  run  with  the  center  at  the  geographic  center  of  the 
unit  square;  in  this  way,  larger  problems  could  be  examined. 

Several  series  of  problems  were  run  with  identical  values  of  n 
and  m  (and,  of  course,  different  randomly  generated  points)  to  see 
how  stable  the  running  time  is  from  one  problem  to  another.  The 
standard  deviation  was  found  to  be  close  to  the  mean  for  the  problems 
run.  This  essentially  says  that  we  should  not  pay  close  attention  to 
exact  run  times  or  the  exact  number  of  subproblems  examined. 

Series  of  problems  were  run  varying  n  and  m  and  using  both  the 
node  restricting  and  arc  restricting  procedures.  Both  procedures 
were  run  with  the  identical  problems  and,  furthermore,  the  same  set 
of  nodes  was  used  (with  new  nodes  added  as  the  problem  size  grew) 
for  all  problems  in  this  series.  This  results  of  this  experiment  are 
shown  in  Table  1.  As  can  be  seen,  the  running  times  for  both 
procedures  were  comparable  and  run  time  grows  exponentially  with 
problem  size.  (It  was  gratifying  to  find  that  the  optimal  solution 
values  found  by  both  procedures  always  matched!) 


It  has  already  been  mentioned  that  these  procedures  yield  lower 
bounds  which  are  at  least  as  great  as  those  obtainable  using  uncon¬ 
strained  MST's.  This  was  verified  empirically  by  actually  generating 
lower  bounds  with  MST's  as  the  algorithm  proceeded.  The  program 
was  to  print  any  exceptions,  i.e.,  any  times  where  the  MST  gener¬ 
ated  a  higher  lower  bound;  none  occurred.  Figure  3  shows  some 
typical  lower  bound  values  obtained  using  partitioning  and  MST's.  As 
can  be  seen,  not  only  are  the  partitioning  lower  bounds  greater,  but 
they  grow  more  quickly  with  depth.  This  is  significant,  as  a  linear 
increase  in  lower  bound  value  will  reduce  the  run  time  exponentially. 

To  measure  the  impact  of  the  difference  in  lower  bounds  between 
the  MST  and  partitioning  methods,  several  problems  were  run  first 
with  the  partitioning  method  and  then  with  the  MST  method  of  lower 
bounding.  The  results  of  this  experiment  are  shown  in  Table  2.  As 
can  be  seen,  the  partitioning  algorithm  examines  a  much  smaller 
number  of  subproblems,  and  apparently,  its  effectiveness  increases  as 
the  problems  grow  larger.  Thus,  although  it  is  somewhat  more  diffi¬ 
cult  to  evaluate  the  lower  bound  using  the  partitioning  algorithm  than 
it  is  using  an  MST,  it  is  less  than  n  times  as  hard  to  do  so.  The 
reduction  in  the  number  of  subproblems  which  must  be  examined 
appears  to  be  sufficiently  great  to  warrant  the  use  of  the  partitioning 
technique.  Indeed,  as  the  problem  size  grows,  its  attractiveness 


seems  to  increase. 
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VI.  SUMMARY  AND  CONCLUSIONS 

The  purpose  of  this  study  was  to  develop  improved  exact  tech¬ 
niques  for  the  solution  of  the  CMST  problem  (a  model  of  the  multidrop 
line  problem)  so  that  known  heuristics  for  the  solution  to  that  prob¬ 
lem  could  be  examined  on  a  broader  class  of  problems  and  so  that  new 
heuristics  could  be  developed  on  the  basis  of  what  was  learned. 
Much  of  this  happened.  An  improved  exact  technique,  based  upon 
generating  lower  bounds  using  partitioning  instead  of  MST's,  was 
developed  and  computational  experiments  were  run  using  it.  The 
bounds  yielded  by  these  techniques  were  tighter  than  those  yields  by 
the  MST  based  techniques,  and  hence,  the  number  of  subproblems 
which  had  to  be  examined  in  order  to  obtain  a  solution  was  smaller. 
Indeed,  the  decrease  in  the  number  of  subproblems  examined  more 
than  compensated  for  the  increased  effort  required  for  the  examina¬ 
tion  of  each  subproblem.  Thus,  the  new  techniques  served  their 
purpose  in  that  they  permitted  the  examination  of  problems  not  care¬ 
fully  examined  before.  In  particular,  it  was  possible  to  examine 
problems  with  very  tight  constraints,  although  it  was  not  possible  to 
examine  problems  of  substantially  greater  size  than  had  been  pre¬ 
viously  examined. 

Even  with  the  improved  technique,  the  growth  of  run  time  with 
respect  to  problem  size  was  found  to  be  exponential,  albeit  of  a  lower 
order  than  previously  know  exact  techniques  and  of  a  much  lower 
order  than  the  solution  space.  Thus,  one  cannot  use  the  technique 
for  large  problems.  A  number  of  acceleration  techniques  were  devel¬ 
oped  and  incorporated  into  the  procedure. 
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Thus,  it  was  possible  to  examine  sufficiently  interesting  problems 
using  the  exact  technique  to  make  several  insights  into  the  problem. 
The  first  is  that  the  performance  of  the  known  heuristic  degrades  as 
the  constraint  tightness  increases  and  improves  as  the  problem  size 
increases.  An  imporved  heuristic  [33]  was  also  developed  on  the 
basis  of  this  study. 
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Table  1:  Number  of  Subproblems  Examined 


n 

m 

Node  Restricting 

Arc  Restricting 

8 

2 

16 

34 

8 

3 

13 

24 

10 

2 

120 

199 

10 

3 

57 

68 

10 

4 

67 

73 

12 

2 

298 

696 

12 

3 

375 

362 

12 

4 

171 

138 

14 

2 

766 

723 

14 

4 

526 

379 

16 

3 

not  run 

1085 

16 

4 

818 

737 

18 

3 

not  run 

6832 

TABLE  2:  COMPARISON  OF  NUMBER  OF  SUBPROBLF.MS  EXAMINED 
USING  MST  AND  PARTITIONING  AS  LOWER  BOUNDS' 
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m 


NUMBER  OF 
SUBPROBLEMS 
(MST) 


NUMBER  OF 
SUBPROBLEMS 
(PARTITIONING) 
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FIGURE  3:  COMPARISON  OF  LOWER  BOUND  VALUES  USING  MST  AND 

INTERSECTION 
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I-  Introduction 

There  are  many  network  design  p rob  lens  which 
would  be  easily  solvable  except  for  the  presence 
of  a  fixed  charge  in  the  cost  of  each  facility. 
The  presence  of  the  fixed  charge  however,  which 
cannot  be  ignored  without  destroying  the  validity 
of  the  nodel.  Bakes  these  probiens  euch  sore 
difficult  to  solve.  We  begin  by  presenting 
several  such  probless  sad  a  general  technique 
which  say  be  of  use  in  obtaining  solutions  to  all 
of  then.  We  then  focus  on  one  such  probies  and 
actually  apply  the  technique  to  its  solution. 

II.  Ptoblen  5 to tenants 

The  nost  basic  problea,  and  the  one  which  we 
will  ultimately  focus  on.  is  that  of  locating 
earth  stations  in  a  satellite  network.  We  are 
given  a  fixed  set  of  X  terminal  locations  and  « 
set  of  il  potential  earth  station  location  sites. 
Terminals  communicate  with  one  another  via  the 
satellite  sad  they  reach  the  satellite  via  direct 
terrestial  connection  to  an  earth  station  (see 
Figure  l).  The  cost  of  connecting  terminal  i  to 
earth  station  j  is  given  by  c, . .  The  cost  of  an 

»  h 

earth  station  at  the  j  potential  site  is  de¬ 
noted  by  dj .  The  cost  of  a  satellite  link'  (cr  of 

the  satellite  itself)  is  is  assumed  to  be  dis¬ 
tance  independent  and  linearly  related  to  its 
capacity.  Thus,  the  cost  of  the  satellite  links 
is  aot  affected  by  the  selection  of  earth  sta¬ 
tions  tod  esn  tnerefo-e  o-  ijnorec.  We  assume 
initially  that  tnere  are  no  capacity  constraints 
on  the  earth  stations. 

The  problem  described  above  is  a  well- 
known  uncapacitated  facility  location  problea 
(UTLP) : 

N  M 

Minimize  Z  *  Z  Z  C.  X.  .  ♦  ZdJ. 

i*l  j*i  j  -  J 
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§  .  garth  Station 

S)  •  Satellite 

Terrestrial  Unk 
—B*  •  Satellite  Unk 

Figure  l.  NETWORK  MODEL 

subject  to: 

il.,  »  1  far  all  i 
j 

<  Yj  for  all  i  and  j 
Yj  4(0,  :)  for  all  i  sad  j 

where  corresponds  to  the  fraction  jf 

tarminsl  i  assigned  to  facility  ;  icv  v;  vs 
1  if  facility  j  is  induced  in  the  solution. 


This  problem  da*  been  sued  studied  [l-lOj.  The 
book  by  Haadler  tad  Mircdaodam  (5]  is  a  compen¬ 
dium  of  related  problem  and  solutions.  The 
procedure  of  trlenkotter  (1)  das  been  shown  to  be 
successful  in  obtaining  optimal  solutions  to  aany 
significant  uncapacitated  facility  location 
problems.  The  procedure  we  will  describe  in  the 
following  section  is  equivalent  to  Srlenkotter' s 
procedure  in  the  case  of  the  uncapacitated  facil¬ 
ity  location  problem  but  differs  from  his  in  that 
it  is  based  directly  m  combinatorics  (rather 
than  in  a  linear  prograoeing  dual)  and  is  direc¬ 
tly  extensible  to  many  other  problems  described 
below,  among  them  the  capacitated  facility  loca¬ 
tion  problem. 

We  note  that  the  UFL?  reduces  to  a  trivial 
problem  where  each  terminal  is  connected  to  its 
nearest  (least-cost  connection)  earth  station  if 
we  Ignore  the  fixed  cost  of  the  earth  stations. 
This  is  the  property  which  unifies  all  the  prob¬ 
lems  we  will  examine. 

There  are  many  extensions  to  the  UFL?  which 
are  also  part  of  this  class  of  problems.  If  a 
capacity  constraint  is  imposed  on  the  earth 
stations,  the  problem  reduces  to  an  assignment 
problem  if  the  cost  of  the  earth  stations  is 
ignored  and  we  assume  either  that  the  capacity 
constraint  is  on  the  number  of  terminals  or  we 
allow  a  terminal  to  'oe  associated  with  more  than 
one  earth  station  and  the  cost  of  connection  is 
linear  with  traffic. 


lowing  transformation.  A  bogus  potential  eartb 
station  location,  is  introduced  with  d^^O 

and  C.  _ ,=V.  for  iml,...N.  The  corresponding 

i  i 

capacitated  or  uncapacitated  problem  is  then 
solved  to  find  the  optimal  solution  including  all 
terminals.  The  bogus  earth  station  and  all 
terminals  associated  with  ic  are  then  removed 
from  the  optimal  solution  to  yield  an  optimal 
solution  to  the  problem  with  a  choice  of  which 
terminals  to  Include  (4).  Thus,  we  will  limit 
considerations  to  the  fixed  terminal  problems  in 
the  sequel. 

Another,  closely  related,  problem  is  that  of 
locating  repeaters  in  a  radio  based  network. 
Here  we  assume  we  are  given  a  set  of  X  terminal 
locations  and  M  potential  repeater  locations. 
Terminals  communicate  via  radio  links  to  the 
repeaters.  The  repeaters  coemunicate  directly 
with  one  another,  again  via  radio  links,  relaying 
messages  between  terminals.  A  terminal  can  or 
cannot  communicate  with  a  repeater  depending  on 
the  distance  between  them  and  the  nature  of  the 
intervening  terrain.  Thus,  we  have  a  UFL?  with 
all  c^efO,  »}.  As  we  will  tee,  the  nature  of 

the  cost  matrix  makes  this  problem  different  from 
the  previously  stated  UFLP's  in  several  important 
ways  which  will  alter  the  effectiveness  of  the 
proposed  solution  techniques. 

II.  Basic  Procedure 


Another  problem  is  that  of  designing  a 
network  as  above  where  not  all  terminals  need  be 
included.  In  this  version  of  the  problem,  there 
is  a  value,  V^,  associated  with  including  termi¬ 
nal  i  in  the  network  and  we  seek  to  maximise  the 
profit,  P,  where  P  is  given  by 


subject  to 


H 

IX..  *  W.  for  all  i 
J»l  1 


X.  <  T.  for  all  i  and  j 

ij  -  i 


Vi*  now  present  a  general  procedure  for  the 
solution  of  all  of  the  above  problems.  For  the 
sake  of  clarity,  we  describe  application  of  the 
procedure  to  the  solution  of  the  UFL?.  Similar 
procedures  have  been  proposed  by  Held  &  Karp  [b], 
and  Camerini  [lj  for  the  solution  of  a  broad 
class  of  problems.  Those  procedures  have  been 
shown  to  be  effective  in  some  cases  and  ineffec¬ 
tive  la  others.  One  of  our  objectives  is  to  shed 
light  on  Inherent  differences  among  problems 
which  differentiate  them  in  terms  of  the  effec¬ 
tiveness  of  these  procedures  and  others  like 
then. 

Let  where  P„  is  a  penalty  added 
to  the  cost,  C^j ,  of  connecting  terminal  i  to 
tarth  station  j.  If  P ^  >  0  for  all  i  and  j  and 


X  «it  Tj  e{0,  1}  for  all  1  and  j 

A  capacitated  version  of  this  problem  exists 
by  adding  the  constraints 

I  Xl}  tt  <  S.  for  all  j 
where  t.  is  the  traffic  associated  with 

i 

terminal  i  and  is  the  capacity  of  earth  sta¬ 
tion  j . 

This  problem  can  be  reduced  to  the  preceding 
one,  with  fixed  terminal  locations,  by  the  fol- 


then  the  solution  to  the  UFLP  using  C.  in  place 

*  J 

of  and  dj*0  is  <  relaxation  of  the  original 
UFL?  for  any  P^  satisfying  the  above  con¬ 
straints.  Here  tbe  facility  costs  are  distrib¬ 
uted  over  the  terminals.  The  solution  to  tbe 
problem  with  d  *0  is  trivially  obtained  by  se¬ 
lecting  the  minimum  in  each  row.  We  note 

that  the  solution  value  thus  obtained  is  indeed  a 
lower  bound  to  a  feasible  solution  since 


(C. .  -  C  )  <  d 

ij  IJ  -  J 


where  I  11  the  set  of  terminals  associatad  with 
J 

earth  station  j  ia  the  solution  to  the  relaxed 
problea. 

Ideally,  we  would  choose  the  P  to  maximize 

the  value  of  the  relaxed  optiaua  and  thus  provide 
as  tight  a  bound  as  possible,  i.e.,  solve: 

Maximize  over  P. . 

ij 

IZ  *  Z  min  (C  ♦  P  )] 
i  j  J  J 

this  problea  has  been  successfully  approached 
using  subgradient  optimization  techniques  [6,10]. 
For  the  aoaent,  however,  we  defer  further  discus¬ 
sion  of  how  best  to  set  the  P^  and  note  that 

relatively  siaple  heuristics  often  suffice  in 
many  cases  while  in  other  cases  even  the  optimal 
P^  (i.e.  those  aaxiaizing  the  lower  bound)  leave 

a  lower  bound  significantly  lower  than  the  opti- 
aal  solution  to  the  unrelaxed  problea. 

For  the  specific  case  of  the  ITLP,  a  linear 
programming  problea  can  be  solved  to  find  the 

optimal  P.  . 

tj 

Maximize  2  »  I  V. 


Ctj  ♦  Pij  >  Vt  for  all  i  and  j 

Z  P. .  *  d  for  all  j 
<  J 


This  is  the  basis  for  Erlenkotter' a  procedure 
but,  again,  he  points  out  that  often  a  siaple 
heuristic  suffices  to  set  the  P„  . 

The  solution  to  the  relaxed  problea  and  the 
values  of  (slack)  variables 


S  *  Z  MAX  (0,  V  -  C. .) 
J  ,  J  ij' 


can  be  used  as  the  basis  for  a  branch  and  bound 
procedure  to  find  the  optiaua  UFLP  solution. 
Specifically,  we  begin  by  setting  the  P^  and 

solving  the  relaxed  problea.  We  then  obtain  a 
feasible  solution  to  the  original  problea  by 
selecting  the  earth  stations,  j,  for  which  S^«0. 

These  correspond  to  earth  stationa  which  justify 
their  cost  by  virtue  of  the  savings  obtained  for 
terminals  associated  with  thea.  To  see  this, 
aote  that,  during  the  procedure ,.  we  try  to  aake 
(V  (or  equivalently,  the  ainiaua  in  row  i)  as 

large  as  possible  in  order  to  aake  the  lower 


bound  as  large  (tight)  as  possible.  Thus, 
auat  equal  0  for  some  i  with  C  <  V  ;  or  else  we 

ij  -  j 

would  aake  IT  larger. 

Let  J*»{j|S  *0j.  If.  for  all  i,  C.  >  V 
*  J  •  “  J 

for  all  jeJ  except  for  a  single  j  tt^  bound  is 
tight.  If  f°r  two  or  more  jtj  ,  there  is 

a  gap  between  the  lower  bound  and  the  feasible 
solution.  Erlenkotter  suggests  a  procedure  for 
adjuating  the  V  to  sometimes  eliainate  such 

situations.  If  a  gap  reaains,  one  such  j  is 
selected  and  alternately  forced  in  and  out  of  the 
solution.  The  procedure  is  continued  with  the 
newly  formed  subprobleas  in  place  of  the  problea 
which  spawned  thea.  A  subproblea  is  said  to 
fathoa,  and  bence  it  eliainated  froa  further 
consideration,  when  its  lower  bound  equals  or- 
exceeds  the  currently  best  feasible  solution. 

A  similar  procedure  can  be  defined  for  the 
capacitated  facility  location  problea.  Again  the 
fixed  coats,  d^ ,  are  replaced  by  penalties,  P^ , 

added  to  the  satisfying  P„  >  0  and 

Z^  j  <  dj .  An  assignaent  problea  is  then 

solved  to  obtain  a  lower  bound  on  the  original 
problea.  Heunttica  for  setting  the  P  and 
obtaining  a  feasible  solution  are  less  obvious 
here  and  very  little  work  has  previously  been 
done  on  these  probleae. 


Ill .  A  More  General  Procedure 

We  now  describe  the  procedure  la  a  more 
general  context.  Define  a  aatroid,  M,  to  be  a 
2-tuple  (E,F)  where  E  is  a  finite  set  of  eleaents 
{*j  I  j*l, " '!•!  *md  F  is  a  faaily  of  independent 

subsets  of  E.  Our  notion  of  independence  is  very 
general.  We  require  only  that  independent  sets, 
I,  satisfy: 

1.  If  I  c  F  and  I*  c  I  then  I'  l  F;  i.e.,  all 

subsets  of  an  independent  set  are  inde¬ 
pendent. 


2.  Given  two  independent  sets,  E^  iad 

containing  p  and  p  *  1  elements  respec¬ 

tively,  there  exists  some  e  cl  .  ,  e  Cl 

j  ?-i  j  p 

such  that  I  {«  )ef.  That  is,  we  can 
always  find  *an  element  in  Che  larger  mde- 
pendent  sec  (noc  already  m  the  saaller  one) 
which  can  be  added  Co  Che  saaller  indepen¬ 
dent  sec  without  destroying  independence. 

The  book  by  Welsh  [9]  is  an  excellent  treat- 
sent  of  aatroid  theory  and  the  boos  by  Lawler  (7] 
gives  applications  of  aatroid  theory  to  the 
solution  of  coabioatorial  optimization  prooleas 
including  ouch  ot  the  discussion  in  this  section, 
f Hserini  [1]  gives  ouch  of  the  rest. 

There  are  aeny  types  of  aatroids.  Three 
types  are  of  particular  interest  to  us: 

1.  Metric  tlatroids  -  Let  E  be  the  set  of  col- 
uams  of  a  matrix  and  F  the  faaily  of  sets  of 
columns  which  are  linearly  in  dependent. 

2.  Graphic  Hatroids  -  Let  E  be  the  set  of  edges 
of  an  undirected  graph  and  F  be  the  faaily 
of  edge  secs  that  contain  no  cycles  (i.e., 
acaberi  of  F  are  trees  and  forests). 

3.  Partition  Hatroids  -  Let  E  be  an  arbitrary 
set  sod  B^,  i  *  l,'"K  be  a  partition  of  E; 

i.e.,  U  B.  *  E  and  B.  B.  *  «  for  i  P  j. 

i  1  1  J 

Let  F  be  the  faaily  of  subsets  of  C  con¬ 
taining  no  more  than  A.  elements  from  B., 
i.e.,  1 

I  t  F  iff  II  3il  <  At  for  all  i 

It  is  fairly  easy  to  see  these  three  notions 
of  independence  satisfy  the  two  required  proper¬ 
ties  and  thus  that  the  three  2-tuples  are  indeed 
aatroids. 

Hatroids  are  of  interest  in  the  context  of 
the  solution  of  coabioatorial  optimization  prob¬ 
lems  because  of  the  existence  of  very  efficient 
procedures  for  the  solution  of  problems  which  can 
be  shown  to  be  equivalent  to  finding  the  "best”' 
independent  set  in  a  aatroid. 

Specifically,  suppose  a  weight,  w^ ,  is 
associated  with  each  element  e.cE  for  some 

J  sir 

aatroid  (E,  F) .  We  nay  wish  to  find  the  set  I  cF 
satisfying 

W(I*)  a  HAX  (W(I)) 

Itf 


where  V(I)  a  I  w 

jel  J 

Without  the  structure  inposed  oy  the  aatroid 
properties,  the  solution  of  this  problem  could 
entail  exaaimng  all  ItF,  potentially  on  the 
v 

order  jf  2  subsets  if  |EI*N.  We  can  prove,  how¬ 


ever,  using  the  aatroid  properties  that  the 
algorithm  [7]  solves  the  proolea. 

Greedy  Algorithm 
■* 

Step  0:  I  -  0 

i  -  0 

Reindex  the  e.  so  that  w  >  w.  for 
j  <  k  J  J  “  * 

Step  1:  i  -  i  ♦  1  „ 

If  i  >  N,  stop;  I  is  the  optimal 
set 

Step  2:  If  I*  {e .  ]  t  F  set  I*  *■  I*  {e.) 

Return  to  Step  1  1 

Thus,  we  need  only  sort  the  eleswnts  of  E 
and  examine  each  cleaent  once,  testing  for  inde¬ 
pendence  with  the  previously  chosen  elements. 
Assuming  the  test  for  independence  is  not  too 
difficult  (and  it  is  quite  aiaqiLe  in  the  three 
cases  above) ,  the  overall  procedure  is  of  the 
order  of  a  polynomial  in  N.  Thus,  large  probleas 
can  be  solved. 

Furthermore,  even  if  w^  is  replaced  by  any 
aonotone  (increasing)  f(v^)  the  greedy  algorithm 

will  still  find  an  optimal  solution;  that  is,  the 
optimal  solution  found  using  w^  above  is  also 

optimal  for  any  aonotone  f(w^).  Thus,  f  need 
only  be  evaluated  for  the  e^  actually  chosen. 

Sometimes  we  wish  to  find  an  independent  set 
of  minimum  total  weight  subject  to  the  restric¬ 
tion  that  it  also  be  of  aaxiaua  cardinality.  An 
example  of  this  is  a  ainiaal  spanning  tree  (i.e. 
finding  a  tree  spanning  all  nodes  and  of  ainiaua 
total  length).  The  greedy  algorithm  can  be  used 
to  solve  this  problea  by  setting  w^  a  A  - 

where  the  e^  are  edges  of  the  given  graph,  is 
the  length  of  e^ ,  and  A  >  for  all  j. 

Unfortunately,  many  probleas  cannot  be 
phrased  as  finding  aaxiaua  weight  independent 
sets  in  a  aatroid  because  the  sets  satisfying  the 
constraints  do  not  conform  to  the  two  properties 
given  above.  One  may,  however,  be  able  to  phrase 
the  problea  as  the  intersection  of  two  or  sore 
aatroids  as  follows. 

The  intersection  of  two  aatroids,  Mj*(E,  Fj) 
and  Hj*(E,  r^),  defined  over  the  sane  set  of 

elements  but  with  two  different  nations  of  inde¬ 
pendence,  is  the  Z-tuple  (E.F)  where  leF  iff 
(IcF.  and  ItFj).  Unfortunately,  the  intersection 
of  two  aatroids  is  not  a  aatroid  and  thus  aaoy 
properties  and  algorithms  do  not  carry  over. 
There  is,  however,  an  efficient  algcntha  (e.f. 
...iwler)  for  finding  the  aaxiaua  weight  set  ItF 
for  an  intersection  of  two  aatroids. 


Cote  Matrix 


The  intersection  it  iC  satroids  can  similarly 
be  defined  is  o  2-tuple  iE,F)  where  ItF  iff  lef 

J 

for  all  There  is  ao  known  algorithm 
with  a  runnini  time  polynomial  in  the  number  of 
elements  in  £  to  find  the  aaxiaua  weight  ItF  for 
<C>2.  Indeed,  the  problea  for  ii  >  1  has  been 
shown  to  be  SP-conplete .  It  has  also  been  shown 
that  the  probleas  for  K>3  are  essentially  equiva¬ 
lent  to  probleas  for  .7*3,  i.e.,  it  is  only 
slifhtly  aore  difficult  to  solve  probleas  for 
general  K  than  it  is  to  solve  probleas  for  K«3. 

We  now  turn  to  the  detailed  solution  of  one 
of  the  ipecific  probleas  described  in  the  preced¬ 
ing  sections. 


The  Repeater  location  Problea 


The  techniques  described  in  the  preceding 
sections  is  applied  to  the  solution  of  the  re¬ 
peater  location  problea  in  a  broadcast  radio 
network.  In  this  problea,  we  are  given  a  set  of 
X  terminal  locations,  t^,  a  set  of  H  potential 

radio  repeaters,  r^,  and  a  coverage  matrix 
defined  by: 


1  if  ti  can  coaauoicace  with  r^ 
0  otherwise. 


One  could  thus  consider  solving  the  repeater 
location  (RLP)  problea  as  s  UFLP .  la  practice 
this  turns  out  to  be  a  had  idea  because  of  the 
syaaetry  of  the  RLP.  la  particular,  all  repeat¬ 
ers  have  identical  coats  aad  all  teraioals  con¬ 
nect  with  identical  coat  to  repeaters  which  cover 
thea.  An  algorithm  for  the  general  IJFL?  would 
have  to  wade  through  a  large  number  of  alternate 
optiau.  We  can  eliminate  a  large  part  of  this 
problem  and  in  fact  uae  the  symmetry  to  advantage 
by  applying  the  following  dominance  rules  (c.f. 
Garf inkle  and  Xeahauaer)  to  eliminate  potential 
aolutlona: 


Ut  -  U  I  c.j  *  1} 
aad  Tj  >  |l  I  c^  *  1} 

Thua ,  R^  is  the  set  of  repeaters  which  can  cover 

terminal  1  aad  T.  is  the  set  of  terminals  covered 
by  repeater  j .  J 

Rule  1:  If  Ri  R^  then  t^  dominates  t^  and  t^ 
cam  be  eliminated  from  the  problem. 


A  feasible  network  design  corresponds  to  a  selec¬ 
tion  of  repeaters  which  "cover”  all  the  terminals 
in  the  sense  that  tach  terminal  can  communicate 
with  at  laast  one  repeater.  (We  assume  the  re¬ 
peaters  can  always  communicste  with  one  another. ) 
Aa  optimal  solution  is  a  feasible  sot  of  repeat¬ 
ers  of  minisnsm  cardinality.  In  this  simple  form 
of  the  problem,  all  repeaters  are  assumed  to  have 
identical  cost  (say,  l)  aad  unlimited  capacity. 
Thua,  the  problem  is: 

n 

Minimize  Z  *  l  T, 

J«t  J 

X 

s.t.  I  C.  .  T,  >  l  for  i  *  l,'"X 
j«l  lJ  J  ~ 

aad  Tj  *  0  or  l  for  j  *  l,""M 

The  variables  T,  correspond  to  selecting  the 
repeater  in  potential  location  j  (T.*l)  or  not 
selecting  it  (7^=0).  This  is,  of  course,  s  set 
covering  problem  (c.f.  Garfiakle  and  Xemhauser). 

We  could  also  considered  this  problea  as  an 
uncapacitated  facility  location  problem  (UFL?) . 
In  this  latter  case,  the  coverage  matrix  corre¬ 
sponds  to  the  matrix  of  costs  of  connecting 
location  i  to  facility  j  m  the  UFLP  at  follows: 


Rule  2:  If  T.  T.  then  c.  nominates  r,  and  rk 
j  x  j  xx 

can  bo  eliminated  from  the  problea. 

'  It  is  easy  to  see  that  it  suffices  to  con¬ 
sider  only  undoainated  terminals  and  repeaters  in 
seeking  an  optimal  solution  to  the  RLP  since  any 
solution  containing  a  dominated  repeater  could 
have  included  an  undominated  repeater  instead  and 
any  solution  which  covers  an  undominated  terminal 
alao  covers  all  terminals  dominated  by  it. 

To  get  the  full  benefit  of  dominance  among 
tezminals  and  repeaters,  rules  1  and  2  should  be 
applied  repeatedly  until  no  further  reduction  in 
the  problea  can  be  effected,  because  it  may  be 
that  ^  dominates  r^  after  some  dominated  termi¬ 
nals  are  removed  from  consideration.  Further¬ 
more,  in  the  course  of  a  branch  and  bound  pro¬ 
cedure,  vhere  repeaters  are  forced  into  and  out 
of  the  solution  to  define  disjoint  subproblems , 
rules  1  and  2  may  be  successfully  applied  to  find 
further  dominance  in  subproblems. 

A  version  of  the  above  procedure  was  coded 
up  and  run  on  a  variety  of  problems.  The  results 
of  these  runs  are  sbovn  m  Tables  I  4  II.  As  can 
be  seen,  the  procedure  is  quite  effective  for 
aeny  problems  even  though  only  relatively  naive 
branching  and  bounding  heuristics  were  used,  in 
particular,  the  branching  rule  used  was  to  fiud 
the  repeater  which  covers  the  largest  number  of 
currently  uncovered  and  undoainated  terminals  aod 
alternatively  forcing  it  m  and  out  oc  the  solu¬ 
tion.  The  lower  bound,  2^,  was  generated  for 
each  subproblem  using  the  following  procedure. 


Step  0: 


2  =  auaoer  ot  repeaters  forced  into 

the  soLution  in  tins  subproblem 


0  if  C. 
ij 


if  Cij 


=  0 


For  each  repeater. 


r j ,  Let  Nj  *  the 

umber  of  uncovered  and  undominated 
terainals  covered  by  r^  .  Also,  Let 


V1- 


S,  is  the  remaining  weight  to  be 


aaaigned  in  coLuan  j  of  the  cost  aatrin 
Find  the  column  (repeater),  E,  with 


chat  N 


YY 

>  0. 


We  only  consider  J  such 


Let  b  «  Sj/Kj 


Step  3: 


For  i  *  1,  N 
[For  j  *  l,  M 


[If  Cu  a  X 


c. . 

ij 


{wiJ 

*J  *  *J 

VSJ 

] 


wiJ*b 


Step  4:  If  there  is  any  j  with  Nj>0,  return  to 
Step  2.  Otherwise  go  to  Step  S 

St«P  ?.;  *  *18  *  f  °V 

Mote  that  the  above  procedure  simply  assigns 
b  to  all  eleoenta  in  a  cow  at  each  step  and  this 
is  of  the  general  fora  described  above. 

Research  is  currently  in  progress  to  iden¬ 
tify  aore  effective  branching  and  bounding  rules. 
We  see  (see  Table  1)  that  for  probleas  with 
geographic  coverage,  i.e.  whan  the  values  of  C^ 

are  based  oo  the  distance  between  t^  and  r^,  the 

procedure  works  very  well.  This  is  because  the 
dominance  is  very  effective  in  these  cases.  For 
randoa  coverage,  however,  the  doainance  is  lets 
effective.  The  overall  procedure  is  nonetheless 
sufficiently  effective  to  solve  probleas  with  SO 
locations. 


TABLE  1  (Geographic  Coverage* 


NT 

HR 

0 

SOL'H 

COUNT 

30 

30 

.024 

18 

1 

30 

30 

.10 

7 

1 

30 

30 

.46 

2 

1 

50 

50 

.027 

24 

1 

50 

50 

.484 

3 

l 

100 

100 

.071 

18 

1 

100 

100 

.109 

11 

3 

150 

150 

.032 

36 

1 

150 

150 

.103 

11 

3 

150 

150 

.204 

6 

15 

300 

50 

.210 

7 

1 

300 

50 

.483 

4 

1 

NT 

* 

Hub  be  r 

of 

terainals 

NR 

* 

Huaber 

of 

repeaters 

0 

* 

Density,  | 

percentage 

of  I's  in 

coverage 

■atria 

SOL'  N 

* 

Huaber 

of 

repeaters 

m  optimal 

soLution 

COUNT  3  Hunter  of  subprobleas  esaained 


TABLE  tl  (Ranaon  Coverage) 


NT 

HR 

D 

SOL'H 

COUNT 

50 

20 

.10 

9 

l 

so 

20 

.20 

7 

31 

50. 

20 

.23 

6 

51 

so 

20 

.30 

5 

15 

50 

20 

.35 

4 

63 

50 

20 

.50 

3 

15 

so 

20 

.70 

3 

27 

50 

20 

.80 

2 

1 

so 

so 

.05 

17 

9 

so 

so 

.10 

n 

157 

so 

so 

.30 

5 

373 

so 

SO 

.50 

3 

41 
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ABSTRACT 

An  algorithm  is  described  for  designing  cir¬ 
cuit  switched  networks.  In  contrast  to  previous 
work,  the  algorithm  considers  both  voice  and  data 
traffic.  The  necwork  design  is  cost  efficient. 

It  provides  topological  design  as  well  as  routing 
doctrine.  Performance  analysis  for  both  voice  and 
data  traffic  is  carried  out. 

INTRODUCTION 

Circuit  switching  is  a  technology  which  has 
long  been  employed  in  voice  conmunicat  ion  networks. 
In  such  networks,  prior  to  any  communication  tak¬ 
ing  place  between  two  users,  a  connecting  route  is 
established  between  them.  A  sequence  of  channels 
or  trunks  is  reserved  for  their  use.  The  route  is 
dedicated  to  their  communicat ion  for  its  duration. 
If  a  route  cannot  be  reserved,  due  to  traffic  con¬ 
gestion,  the  user  Initiating  the  request  for  con¬ 
nection  Is  "turned  away."  His  call  is  "lost." 
Circuit  switched  networks  are  thus  characterized 
by  an  overhead  delay  In  the  establishment  of  the 
connecting  route.  However,  they  are  also  charac¬ 
terized  by  a  low  delay  during  the  actual  period  of 
coonunlcat ion .  This  Is  due  to  the  absence  of  any 
contention  after  a  route  has  been  set-up.  These 
are  conditions  well-suited  to  voice  coosmmlcat ions. 
The  duration  of  the  typical  telephone  conversation 
la  normally  much  greater  than  the  time  for  route 
aet-up.  The  overhead  can  be  easily  tolerated. 

The  present  paper  provides  an  algorithm  for 
designing  circuit  switched  communication  networks. 
In  contrast  to  previous  work,  attention  is  direc¬ 
ted  to  combined  voice  +  data  traffic,  rather  than 
to  voice  traffic  alone.  There  Is  great  interest 
In  designing  networks  for  such  integrated  ccxmsunl- 
catlons.  Traffic  volvases  are  much  higher  than  in 
separate  networks.  Consequently,  savings  can  be 
achieved  through  economy  of  scale. 

In  most  Integrated  communications  voice  traf¬ 
fic  (at  least  at  the  outset)  is  much  higher  than 
data  traffic.  Circuit  switching  provides  a  reas¬ 
onable  connection  technologv.  This  is  the  net¬ 
working  alternative,  which  from  the  delav  point  of 
view,  la  moat  well-matched  to  the  dominant  traffic 
element.  If  data  traffic  grows  relative  to  voice 
traffic,  argunents  can  well  be  made  for  using  ot¬ 
her  methods  of  connection,  such  as  packet  switch¬ 
ing  or  hybrid  switching.  However,  in  a  combined 
environment  voice  traffic  will  always  be  signifi¬ 
cant.  Circuit  switched  networks,  with  their  mat¬ 


ure  switching  technology,  will  alwavs  be  attract  ive. 

The  algorithm  presented  is  unified.  It  pro¬ 
vides  a  circuit  switched  network  design  for  a  pre¬ 
specified  end-to-end  performance  criterion  --  the 
probability  that  a  call  is  lost.  The  design  spec¬ 
ifies  the  network  topology  --  the  connectivitv  of 
network  nodes  and  the  dimension  of  links  in  terms 
of  number  of  trunks.  A  routing  doctrine  is  speci¬ 
fied  with  both  primary  and  alternate  routes.  The 
average  cross-network  delay  performance  for  data 
traffic  is  provided.  Network  cost  is  given,  hav¬ 
ing  been  computed  from  both  link  and  node  require¬ 
ments.  It  should  be  emphasized  that  only  the  back¬ 
bone  network  is  considered.  Local  traffic  is  as¬ 
sented  to  be  concentrated  at  the  backbone  nodes  us¬ 
ing  standard  teleprocessing  techniques  such  as  mul¬ 
tiplexing  and  concentration.  The  authors  have  drawn 
from  a  wealth  of  previous  work  in  the  area  of  cir¬ 
cuit  switching.  Specific  acknowledgement  is  paid 
to  the  work  of  Katz  and  Knepley  C23  and  1 3 i  .  How¬ 
ever,  It  Is  believed  that  this  Is  the  first  algor¬ 
ithm  which  considers  both  voice  and  data  traffic 
carried  on  the  same  circuit  switched  network.  Fur¬ 
thermore,  the  voice  and  data  traffic  are  not  multi¬ 
plexed  on  the  same  channels,  (In  a  TASI  type  arran- 
gment),  rather  they  contend  for  available  link  cap¬ 
acity. 

DESCRIPTION  OF  THE  ALGORITHM 
High  Level  Description 

Before  proceeding  with  a  detailed  description 
of  the  design  algorithm  It  would  be  worthwhile  to 
take  a  look  at  a  high  level  block  diagram,  Illus¬ 
trated  In  Figure  1.  The  major  functional  segments 
of  the  design  procedure  are  shown  here.  The  PRE¬ 
LIMINARY  SECMENT  provides  the  necessary  design  in¬ 
puts,  e.g.  node  locations,  traffic,  desired  perfor¬ 
mance,  etc.  It  also  ''massages"  the  Inputs,  putting 
them  In  the  form  necessary  for  later  algorithmic 
procedures.  The  TOPOLOGICAL  DESIGN  SEGMENT  carries 
out  the  optimization  of  the  network  design.  It  pro¬ 
vides  a  cost  effective  connectivity  of  the  nodes 
for  the  desired  traffic.  It  also  dimensions  the 
network  links  to  meet  the  desired  performance  goal. 
The  ROUTING  SEGMENT  gives  the  doctrine  to  be  used 
on  this  topological  design.  It  defines  the  routes 
through  the  network  used  to  connect  desired  end- 
user  nodes.  The  block  labelled  NETWORK  SPECIFICA¬ 
TIONS  sis-anarlzes  the  network  design  up  to  the  point 
in  the  algorithm;  that  Is,  the  topology,  trunk  di¬ 
mensions  and  routing  doctrine.  The  VOICE  PERFOR¬ 
MANCE  ANALYSIS  SECMENT  analyzes  the  performance  of 
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the  network  stored  in  the  MFTWORK  SPFCIFICATION 
with  respect  t  voice  traffic.  It  estimates  the 
average  probab i  l  i  tv  of  a  lost  ca^l  for  the  given 
input  traffic.  This  is  the  probability  that  a 
voice  communicator  desiring  connection  to  a  remote 
node  Is  turned  jwav.  Nothing  is  done  it  the  aver¬ 
age  probabilitv  of  a  lost  call  is  within  a  given 
tolerance  of  the  goal  in  the  PRELIMINARY  SF.OIF.NT. 
Otherwise,  instructed  adjustments  are  fed  back  to 
the  NETWORK  SPECIFICATION.  The  topological  design 
Is  not  changed.  However,  the  link  dimensions  in 
terms  of  trunks  are  changed  to  bring  the  measured 
average  probability  of  a  lost  call  close  to  the 
desired  goal.  The  DATA  PERFORMANCE  ANALYSIS  SEG¬ 
MENT  measures  the  average  cross  network  delay  for 
data  traffic  carried  in  the  resulting  network  sto¬ 
red  in  the  NETWORK  SPECIFICATION  block.  The  de¬ 
sired  goal  for  delay  does  not  impact  network  de¬ 
sign.  This  block  is  merely  a  post -processor .  The 
COSTING  SEGMENT  block  computes  the  transmission 
and  switch  cost  for  the  resulting  network.  Finally 
the  OUTPUTS  block  summarizes  the  circuit  switched 
network  design. 


Fig.  I  High  Level  Block  Diagram  of  Algorithm 


Detailed  Description 

A  detailed  description  of  each  portion  of  the 
algorithm  structured  in  Fig.  1  will  now  be  given. 


Preliminary  Segment  Consists  of  three  separ¬ 
ate  blocks  shown  in  Fig.  2.  The  INPUT  BLOCK  stor¬ 
es  the  constraints  which  the  network  is  designed 
to  satisfy.  Here  are  the  backbone  nodes  and  their 
locations.  These  are  labelled  l , 2 , . . • l, . . . j NN. 
The  voice  traffic  is  stored  as  a  matrix  *nd 

the  data  traffic  as  a  matrix  £D|jl.  Voice  traffic 

units  are  Erlangs  and  data  traffic  units  are  bits 
per  second.  The  i-J  entries  of  each  of  these  ma¬ 
trices  constitute  the  traffic  originating  at  node 
"!'*  which  Is  destined  for  node  "j".  Voice  traffic 
Is  taken  to  be  symmetric.  A  party  initiating  a 
voice  call  must  automat ical ly  provide  an  answer- 
path.  An  Erlang  of  voice  traffic  originating  at 
node  "i"  with  destination  "j"  automatically  imp¬ 
lies  an  equal  flow  from  "j"  to  "i".  The  design 
algorithm  assumes  a  particular  "archi tecture"  for 
handling  data  traffic.  Parameters  associated  with 
this  architecture  are  provided  in  the  INPUT  BLOCK. 
Specifically,  two  different  tvpcs  of  data  messages 
are  allowed  to  be  handled  by  the  desired  circuit 
twitched  network, a  bulk  file  transfer  CTyp*  A)  and 
a  ahort  interactive  message  (Type  B) .  A  fixed 
fraction  of  the  data  traffic  pattern,  RHOA  Is  assu¬ 
med  to  correspond  to  Tvpe  A  traffic  and  a  fixed 
fraction,  RHOB  is  assumed  to  correspond  to  Tvpe  B 
traffic.  Type  A  file  transfer  messages  are  taken 
to  consist  cf  IA  bits  and  are  handled  In  the  tol- 


l  owing  wav  Upon  rh.*  snrt  >i  Tvpe  A  "ims  «*<• 
general  i  on  ,  a  conn***  t  in/  r  r  ■  r  h*‘  1  e s r  i  n  i *  ;  «> 

i  s  oht  a  i  ned  b\  i  i  a  i  -  u;- .  I  *  t  ..  r :»«  '  i-i.  r  <•:.■»!  i  • 

IS  cam  ed  nut  eve  TV  f  *t  •••■..  m  ’  s  -r.  t  :  !  1  v.  i.;i.  ■.  r  .  n 

is  obtained.  Tin*  Tv*-<  A  .!««-ssage  is  Su!i«*rre.:  ami 

fed  into  the  connecting  route  at  a  rate  >  r  R  hi  is 
per  second  The  r>  -ur  ■*  :  *.  v  I  !  jr-r  :  i  '  ‘  :  i 

single  message  transfer.  Ihe  c  f  i  .s  then 

broken.  !A  and  R  are  list'd  m  the  INPUT  BI.fX'K 
Typical  values  are  10  an!  9b0*'.  Tvpe  H  interac¬ 
tive  traffic  is  assumed  t  '  have  the  following  cba- 
rac r er  i  s t  i  c s .  An  interactive  message  is  LB  bits 
long.  A  stream  of  interactive  messages  generated 
at  one  node  with  another  as  the  destination  have 
time  separations  of  T  seconds.  T  is  the  "think" 
time  which  depends  upon  the  speed  and  attention  of 
the  terminal  user  generat 1  the  messages.  Upon 
the  start  of  a  Tvpe  R  message  generating  session 
(the  generation  of  the  first  message)  a  r -ut r  is 
dialed  up,  as  for  a  Tvpe  A  message.  However  ,  alter 
the  first  message  is  sent,  the  route  is  not  closed 
down.  Rather,  it  is  taken  to  be  held  open  until 
K-i  additional  Type  B  messages  are  sent  over  it. 
Messages  are  assumed  bufferred  and  ted  to  the  con¬ 
necting  route  at  a  rate  of  R  bits  per  second.  The 
parameters  LB,  T  and  K  are  provided  in  the  INPUT 
BLOCK.  Typical  values  are  LS-10G0  --  a  standard 
teletype  line,  T*10  sec.  and  K-200  or  l.  K-200 
corresponds  to  classical  circuit  switching  of  an 
interactive  data  stream.  Here,  after  dial-up  a 
terminal  user  holds  a  connecting  route  to  a  compu¬ 
ter  for  an  entire  computing  session.  Ka 1  corres¬ 
ponds  to  "fast  circuit  switching."  Here,  dial-up 
and  routing  are  carried  out  for  each  interactive 
message . 


i 
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Fig.  2  Details  of  Preliminary  Segment 

The  PERFORMANCE  GOAL  BLOCK  store*  the  end-to- 
end  performance  desired  for  the  circuit  switched 
network  being  designed.  This  is  the  average  end- 
to-end  loss  probabilitv,  ,  averaged  over  ail 
node  pairs  and  weighted  bv  traffic.  It  al^o  inc¬ 
ludes  the  average  cross  network  cel  a,-  desired  for 
Tvpe  A  and  Type  B  messages  --  again  averaged  over 
all  node  pairs  but  veighceJ  ti  1 v  bv  data  traf * ,c . 

3efore  the  actual  network  design  pro  dure  can 
begin  seme  processing  of  the  parameters  scored  in 
the  INPUT  BLOCK  must  be  carried  out.  This  is  done 
in  the  CONVERSION  BLOCK.  Spec i : ic a l W ,  the  tnter- 
node  distances  are  computed  using  the  r.od*»  loca¬ 
tions.  The  inter-noJe  distances  are  stored  ip  the 
matrix  £l^ ^  .  The  i-j  entry  oi  this  matrix  is  a 
measure  of  the  cost  of  a  link  direct iv  connecting 
node  "i"  to  node  "j".  The  network  design  proceiim? 
operates  with  a  comhtned  traffic  matrix  of  voice 
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and  data  traffic.  The  cc*mblnt*d  entrv  has  units  of 
Erlang*.  Hence,  the  data  t  rat  tie  Has  t*  be  c>nv<r- 
ted  to  equivalent  Erlang  values  .iml  to  the 

voice  traffic.  This  is  carried  out  it',  this  hlo.k, 
by  computing  the  average  number  of  data  calls  per 
second  and  the  call  holding  tire  f  'r  tin  lssurud 
data  traffic  character  ist  ics  and  handling  ?trarep4v 
The  following  formula  is  used  to  compute  F.D.^,  the 

Erlang  value  of  the  data  traffic  originating  at 
node  "i"  with  node  "JM  as  destination. 

ED  -  ^RHOAHD^A +i(RHOB)(Dtj>[i  +<  1  ‘Pill 
g  L  R  K  LB^) 

With  this,  the  Total  Traffic  Matrix  *lv1jl  + 

(ED^^J  Is  computed. 

Topological  Design  Segment  is  the  heart  of 
the  network  design  algorithm.  Optimization  of  the 
network  is  carried  out  in  this  segment.  Details 
of  the  Topological  Design  Segment  are  illustrated 
in  Fig.  3.  Certain  parameters  must  be  defined  be¬ 
fore  the  optimization  procedure  of  this  segment 
can  commence.  This  definition  takes  place  In  the 
INITIALIZATION  BLOCK.  Specifically,  the  value  of 
a  parameter  "E"  la  set  equal  to  1  PL-  This  cor- 

iTJjT 

responds  to  the  link  blocking  probability  "guess¬ 
timate"  required  to  achieve  the  desired  end-to-end 
average  loss  probability.  PL  was  given  in  the 
PRELIMINARY  SEGMENT.  This  "formula"  for  E  is  de¬ 
rived  from  a  "union"  bound  assigning  that  on  the 
average  there  are  'TnN  links  in  each  end-to-end 
route.  A  parameter  "TEST"  is  set  initially  equal 
to  00  .  This  paramter  is  used  in  the  test  for  con¬ 
vergence  of  the  optimization  procedure.  Finally, 
a  matrix  Cwijl  is  defined.  This  is  a  NS  x  NH  ma¬ 
trix.  It  is  referred  to  as  the  "weight  matrix." 
Initially,  it  is  defined  as  having  all  entries 
equal  to  l. 


Length.  .  *  W  This  is  .in  NN  x  NN  ri.it  t  i  - 

Length  j  j  l .»  a  weighted  distance  between  n  ■  1.  "i" 

and  node  "j".  With  the  matrix  U.ength  J  .1'".  i  ‘  , 

j  ijJ 

the  CONNECTIVITY  BIOCK  considers  each  sure.  sjve 
node  pair!  -7,  I/T  It  uses  a  "shortest  ".n  i,” 
algorithm  to  connect  the  nodes  in  each  pair.  E  ic; 
ordered  node  pair 'X  ,3  is  connected  bv  a  route 
directed  links  fromX-to-J.  However,  lengths  in 
the  shortest  path  algorithm  are  taken  relative  t 
the  weighted  distances  given  bv  matrix  ([Length..,]. 
In  the  first  iteration  the  network  put  out  bv  the 
CONNECTIVITY  BLOCK  will  be  fully  connected. 

With  topological  connectivity  defined  the 
TRUNK  ENGINEERING  BLOCK  is  entered.  Each  link  of 
the  connected  network  Is  considered.  All  of  the 
end-to-end  shortest  oath  routes,  determined  in  the 
CONNECTIVITY  BLOCK,  utilizing  a  given  link  are  en¬ 
umerated.  The  end-to-end  traffic  requirement  tor 
each  route  is  obtained  from  the  matrix  (t  ”1  .  For 

lj 

a  given  link  between  nodes  "1"  and  "J",  the  sum 
end-to-end  traffic,  A^.,  of  all  "weighted"  rfiortest 
path  routes  traverslngJthis  link  is  obtained.  If 
circuit  switched  "calls"  are  routed  on  these  oath-, 
Aji  upperbounds  the  traffic  that  will  be  offered  to 
directed  link  (i.J).  The  TRUNK  ENGINEERING  BLOCK 
solves  the  equation  E  -  E(Sji,  A^.)  for  Sj,  and 
quantizes  it.  E(  ,  )  is  the  blocking  probabil¬ 
ity  given  by  the  Erlang  B  formula.  S. .  provides 
dimension  of  directed  link  (1,J)  in  trunks. 

The  COST  METRIC  BLOCK  computes  a  measure  cf 
the  cost  of  the  connected  and  trunk  dimensioned 
network  which  has  been  produced  by  the  two  previous 

blocks.  COST  METRIC  «Tw,  L.  .S.  ,  .  The  sum  is 
l—  lj  ‘J  lJ 

over  all  ordered  node  pairs  1,J  which  are  adjacent 
in  the  network.  The  value  of  COST  METRIC  is  com¬ 
pared  to  the  value  of  the  parameter  TEST.  If  it  is 
greater  than  or  equal  to  TEST,  the  TOPOLOCICAL  DE¬ 
SIGN  SECMENT  is  exited.  If  COST  METRIC  is  less 
than  TEST  the  optimization  procedure  continues. 
Since  TEST  "00  initially,  the  procedure  continues 
for  at  least  two  Iterations.  When  it  continues, 
TEST  is  set  ■  COST  METRIC  and  n  new  weight  matrix 
O  ']  it  computed  from  the  following  formula, 

J  ?  E  (S,A)  /f-  E  (S.AVj- 

A  /  L  ~b  S  -* 

The  denominator  in  this  formula  is  the  rate  of 
change  of  E(S,A)  over  the  interval  (s,  S+l],  i.e. 
E(S+1,  A)  -  E(S,A).  This  can  be  reduced  to 

WM  *  f  E(S-1,A)  -  E(S,A)  \C  _ 1 _  } 

1  t  E(S,A)  -  E (S+l , A )  M[(A/S!  E(S-l,AlJ+l  > 

The  above  two  formulas  are  evaluated  at  S - S ^  ^  , 
A-Aj,.  The  TOPOLOCICAL  DESICN  SEGMENT  then  goes 
the  next  Iteration.  The  CONNECTIVITY  BLOCK  now 
computes  the  matrix  (Length^"]  using  the  new  we  i  ijt 
matrix.  This  iterative  procedure  continues  until  a 
COST  METRIC?-  TEST  is  obtained.  Ey  studying  th" 
formula  for  Wj  ,  one  may  note  that  low  values  are 
obtained  when  dealing  with  a  link  which  in  response 
to  an  incremental  Increase  in  offered  flow  can  a,h 
(pve  the  deslrqd  blocking  p.robab  i  I  i  tv  ,  E,  with  i- 
few  additional  trunks  is  possible.  The  TOPOLOGIOAI. 
DESICN  SEiIMENT  achieves  a  connectivity  at  the  mlv 
bv  employing  short,  highly  utilized  directed  links 
This  provides  a  cost-efficient  circuit  switched 


Th  1  s  is  an  NN  x  NN 
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Fig.  3  Details  of  Topological  Design  Sejpnent 

The  TOPOLOGICAL  DESIGN  SECMENT  operates  bv 
connecting  nodes  with  "short"  high  usage  routes. 
Wlth*iy  "rational"  cost  measure  this  should  be  ec¬ 
onomic  network.  With  parameters  defined  in  the 
INITIALIZATION  BLOCK,  tie  CONNECTIVITY  BLOCK  Is 
entered.  Here,  nuJes  are  connected  ov  routes  us¬ 
ing  the  following  procedure.  The  CONNECTIVITY 
BLOCK  computes  a  matrix  ^Lengthfj"]  .  The  i-J  com¬ 
ponent  of  this  matrix  Is  given  by  the  forr.iula 
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network.  Note  that  the  use  of  the-  weight  matrix 
and  the  specific  formula  for  tho  weight  are  simi¬ 
lar  to  the  .  iinvt'K  branch  elimination  method  used 
to  design  packet  switched  networks . t lj 

Rout  mit  Segment  In  circuit  switched  networks 
when  a  c.mirmnu  ator  at  "  i "  wishes  to  "talk"  to  one 
at  node  "j",  a  c  -nnea  ion  is  first  attempted  Dv 
reserving  trunks  on  a  path  termed  the  "primary 
route."  If  due  to  blocking  the  connection  cannot 
be  completed  an  attempt  is  made  at  reserving 
trunks  along  an  alternate  route.  The  ROUTING  SEG¬ 
MENT  of  the  algorithm,  shown  in  Fig.  U ,  provides 
both  primary  and  alternate  routing  doctrines’  for 
the  network  derived  in  the  TOPOLOGICAL  DESIGN  SEG¬ 
MENT.  The  primary  route  from  node  "i"  to  node  "J" 
is  taken  as  the  shortest  directed  path  between 
these  two  nodes.  "Shortness"  here  is  taken  to 
mean  length  relative  to  ^Length  "3,  as  computed  in 
the  last  iteration  of  the  TOPOLOCICAL  DESIGN  SEG¬ 
MENT.  The  alternate  routing  doctrine  defined  is 
a  form  cf  "progressive  routing."  A  detailed  des¬ 
cription  otf  it  Is  given  in  Ref.  1.43.  Basically, 
at  each  of  the  Interior  nodes  of  the  primary  route 
connecting  "i"  to  "j",  there  is  a  cable  which 
lists  routes  to  node  "j".  These  routes  diverge 
from  the  primary  route  at  the  interior  node.  The 
routes  in  each  table  are  ordered  in  a  hierarchical 
manner.  In  attempting  to  reserve  the  primary 
route,  if  blocking  is  encountered  at  interior  node 
"k",  the  routing  table  residing  at  this  node  is 
consulted.  An  attempt  at  completing  the  route  Is 
then  made  along  the  first  entry  in  the  table.  If 
there  is  blocking  along  this  route  the  second  is 
tried,  etc.  If  there  is  blocking  along  all  of 
these  routes  the  "caller"  is  turned  away  from  the 
network.  The  primary  route  and  possible  alternate 
routes  In  one  situation  are  Illustrated  In  Fig.  5. 
In  the  version  of  the  algorithm  which  ha*  been  im¬ 
plemented,  diverging  alternate  routes  are  taken  to 
be  the  second  shortest  route,  the  third  shortest 
route,  etc.  Shortness  is  taken  according  to  the 
last  values  of  fLengthj^. 

ROUTING  SEGMENT 

f - 1 

I 
I 
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I 

I 
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FIC.  4  DETAILS  OF  ROUTING  SEGMENT 

Network  Specification  In  this  module  of  the 
algorithm  the  network  design  is  stored;  that  is, 
the  topology,  link  trunk  dimensions  and  routing 
doctrine  are  sunsnarized. 

Voice  Performance  Analysts  Segment  and  Ins¬ 
tructed  Adjustments  The  goal  of  the  algorithm  is 


Fig  5  The  Primary  Route  From  Node  "i"  to 
Node  "J"  with  Diverging  Alternate 
Routes 

to  design  a  cost-efficient  network  satisfying  a 
constraint  on  P  .  The  TOPOLOGICAL  DESIGN  SEGMENT 
attempts  to  meet  this  constraint  bv  sizing  the 
trunk  dimensions  of  each  link  to  correspond  to  the 
trunk  engineering  by  utilizing  the  Erlang  P,  formu¬ 
la.  However,  the  data  traffic  and  alternate  rout¬ 
ed  traffic  may  not  have  the  Poisson  arrival  statis¬ 
tics  assumed  by  this  formula.  As  a  result,  the 
network  initially  stored  in  the  NETWORK  SPECIFICA¬ 
TION  BLOCK  may  not  meet  the  desired  P[_.  The  VOICE 
PERFORMANCE  ANALYSIS  SECMENT  and  INSTRUCTED  ADJUST¬ 
MENT  BLOCK  correct  this  deficiency.  The  VOICE  PER¬ 
FORMANCE  ANALYSIS  SEGMENT  analyzes  the  network 
stored  in  the  NETWORK  SPECIFICATION  BLOCK  and  ob¬ 
tains  an  estimate,  called  PL,  of  P| .  Details  of 
the  analysis  procedure  are  given  in  Ref.  (V}.  The 
estimate  P^  is  compared  to  the  desired  value  of  Pj 
If  It  is  within  A  ,  (typically  «10*^),  the  net¬ 
work  stored  In  the  NETWORK  SPECIFICATION  BLOCK  is 
accepted  as  the  design  and  the  algorithm  proceeds 
to  the  DATA  PERFORMANCE  ANALYSIS  SEGMENT.  If  PL 

is  not  within  A  the  algorithm  proceeds  to  the  IN¬ 
STRUCTED  ADJUSTMENTS  BLOCK,  which  computes  a  new 
value  of  link  blocking  probability  form  the  previ¬ 
ous  one  for  trunk  engineering.  The  new  value, 
E(new),  is  obtained  from  t]ie  following  formula. 
E(new)  -  E  +(1//8n')  (Pl  -  PL)  .  The  INSTRUCTED  AD¬ 
JUSTMENTS  BLOCK  then  takes  the  network  topologv 
stored  in  the  NETWORK  SPECIFICATION  BLOCK.  It  car¬ 
ries  out  the  trunk  engineering  of  each  link  tc  sat¬ 
isfy  the  blocking  probability,  E(ne”>.  A  new  net¬ 
work  is  then  produced;  the  routing  doctrines  stay 
the  same.  This  new  network  replaces  the  previous 
one  stored  in  the  NETWORK  SPECIFICATION  BLOCK.  Its 
loss  probability  Is  analyzed  In  the  VOICE  PERFOR¬ 
MANCE  ANALYSIS  SEGMENT.  The  procedure  continues  as 
described  above  until  the  trunk  engineering  produ¬ 
ces  a  ^  within  A  of  the  desired  Pj  . 

Data  Performance  Analysis  Segment  This  block 
computes  D^  and  Dg,  the  average  cross-network  de¬ 
lay  for  Type  A  ar.d  Type  B  data  messages,  respe:t- 
ively.  The  average  is  taken,  among  other  things, 
with  respect  to  all  node  pair  03ta  traffic  require¬ 
ments.  These  quantities  arc  computed  by  obtaining 
D^j  for  each  traffic  tvpe.  This  is  the  average 
cross-network  delay  for  a  data  message  (either 
Type  A  or  B)  which  has  its  source  at  node  "i"  and 
destination  at  node  "J",  in  computed  by  the 

algorithm  using,  D1J  CL  +  T  +  cr’  CL  is 
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the  average  time  taken  In  waiting  to  rodlal  until 
a  route  is  finally  granted.  It  has  been  assumed 
that  if  a  data  communicator  is  turned  away  he  re¬ 
dials  after  10  seconds.  Hence,  C^-10  Pj_/ ( 1  - ^  - 
Cp  Is  the  signal  propagation  time,  which  is  deter¬ 
mined  by  the  actual  length  of  the  l-to-)  primary 
route.  Host  traffic  will  go  along  this  route.  C( 
is  the  transmission  time  for  a  message,  given  by 
the  formula  ^"message  iength/R.  Message  length 
is  La  or  L_  as  specified  in  Che  PRELIMINARY  SEG¬ 
MENT.  Cr  is  the  average  time  taken  to  set-up  the 
route  connecting  node  "l"  to  "j"  or  to  turn  the 
communicator  away  because  of  inability  Co  reserve 
a  route.  Let  this  actual  route  set-up  time  be  de¬ 
signated  as  Tr.  For  the  Type  A  file  transfer  mes¬ 
sage,  or  for  a  Type  B  interactive  traffic  with 
classical  circuit  switching,  the  entire  set-up  de¬ 
lay  is  borne  by  the  first  message.  The  remaining 
K-l  messages  have  zero  set-up  time.  In  this  case 
the  average  Cr  -  Tr/K.  The  value  of  Tr(or  Cf)  de¬ 
pends  upon  the  type  of  signalling  employed  by  the 
circuit  switched  network  in  order  to  set-up  a 
route  — 'the  methods  used  to  transmit  routing  in¬ 
formation  through  the  network  in  reserving  trunks. 
The  design  algorithm  assumes  that  Common  Channel 
Interswitch  Signalling  is  employed.  Separate  di¬ 
gital  links  are  assumed  to  connect  the  nodes  of 
the  network  and  are  reserved  strictly  for  signal¬ 
ling  information.  Routes  are  reserved  by  sending 
small  addressing  packets  on  these  signalling  ljzfcs . 
Transmission  occurs  at  the  rate  of  R,  bits  per  se¬ 
conds.  Typically,  R*  "  9600.  A  circuit  switch 
model  la  incorporated  into  this  block  of  the  algo¬ 
rithm  in  order  to  account  for  delays  in  the  switch. 
These  delays  correspond  to  queuing  delays  in  hav¬ 
ing  the  switch  deal  with  the  routing  requests  and 
finding  a  path  through  the  switch. 

Costing  Segment  In  this  block  the  transmis¬ 
sion  coat  and  the  switch  cost  of  the  circuit  swit¬ 
ched  network  design  are  computed.  Transmission 
cost  of  the  network  is  taken  to  be  the  sun  of  the 
separate  transmission  costs  of  each  of  the  links 
in  the  network.  Switch  cost  is  taken  to  be  the 
sun  of  the  costs  of  the  individual  switches.  Be¬ 
cause  of  the  modularity  of  this  block  it  can  be 
programmed  to  model  any  tariff  and  circuit  switch 
cost.  For  example,  in  our  experiments  we  have  us¬ 
ed  the  DOS  line  rates. 

Outputs  In  this  block  the  final  circuit 
switch  network  design  with  performance  and  cost  is 
stored . 

EXAMPLE  NETWORK  DESIGNS 

The  topologies  of  typical  network  designs  ob¬ 
tained  with  the  algorithm  arc  shown  in  Figs.  6a, 
b,  &  c.  These  are  eight  node  networks  spanning 
the  continental  United  States.  They  were  designed 
for  total  voice  traffic  »  5400  Erlangs,  total  data 
traffic  -  361  kbps,  RHOA  -  RHOB  -  0.5,  K  -  200 
(classical  circuit  switching),  LB  •  1000,  T  •  10. 
The  dealgns  were  carried  out  for  different  values 
of  PL.  Costs  are  noted.  These  designs  were  ob¬ 
tained  by  implementing  the  algorithm  in  Fortran 
The  running  time  was  approximately  five  seconds  on 


a  DEC  20/50  with  the  TOPS  20  operating  system. 
The  memory  required  was  close  to  hS.GOO  <  :<>  bit) 

words . 


Fig.  6a  Pl  *  .002,  Cost  *  $2,476,025  per  month 


Fig.  6c  Pt  *  .1,  Cost  ■  $1,832,319  per  month 
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ABSTRACT  The  problem  of  determining  the  optimal  number  of  WATS  lines 
given  an  hourly  profile  of  outbound  calls  from  a  given  location  is  considered.  It  is 
shown  that  the  relationship  between  cost  and  earned  load  under  the  new  WATS 
tariff  with  a  very  small  fixed  charge  for  lines  is  relatively  smooth  and  unimodal  It  is 
thus  possible  to  obtain  globally  optimal  solutions  to  most  problems  by  using  a  very 
efficient  local  optimization  procedure  which  only  considers  addition  and  deletion  of 
single  lines  and  the  exchange  of  one  line  for  another  Computational  experience 
with  this  procedure  and  a  comparison  with  exhaustive  search  are  presented. 


1.  Introduction 

Wide  Area  Telephone  Service  (WATS)  is  a 
service  offered  by  telephone  companies  to 
provide  volume  discounts  to  larger  users  Un¬ 
til  recently,  WATS  offerings  included  both  full 
and  measured  service  With  full  service 
WATS,  the  user  was  permitted  essentially  un¬ 
limited  calling  for  a  fixed  monthly  fee.  Meas¬ 
ured  service  provided  for  a  lower  fixed  fee.  10 
hours  of  free  service  and  an  hourly  fee  for 
usage  in  excess  of  10  hours  per  month  The 
new  WATS  tariff  eliminates  full  period  WATS 
service  and  offers  a  somewhat  modified  form 
of  measured  service 

There  are  actually  many  WATS  offerings 
for  both  inward  and  outward  callers  to  and 
from  a  given  location  on  either  an  intrastate  or 
interstate  basis  We  will  focus  here  on  the  in¬ 
terstate  out-WATS  service  offered  by  AT&T 

"Thu  work  was  supported  in  part  by  a  grant  from  Con 
Tel  Information  Systems 


Long  Lines  within  the  continental  U  S.  In  this 
case  the  continental  U.S.  is  divided  up  into  5 
bands.  Band  1  consists  of  area  codes  close  to 
the  caller  but  outside  his  state.  Band  2  con¬ 
sists  of  these  area  codes  plus  others  some¬ 
what  farther  away.  This  continues  until  Band 
6,  which  includes  all  area  codes  in  the  US. 
including  Alaska,  Hawaii,  Puerto  Rico  and 
the  Virgin  Islands.  In  this  paper,  we  will  deal 
primarily  with  the  5  WATS  bands  which  cover 
the  continental  U  S  outside  the  caller's  state 
Thus,  each  WATS  band  includes  all  the  lower 
numbered  bands  and  represents  a  roughly  fixed 
percentage  of  the  telephones  in  the  U  S  re¬ 
gardless  of  where  the  caller  is  The  specific 
area  codes  involved  are  a  function  of  which 
state  the  caller  is  in.  The  service  applies  only 
to  interstate  calls  Most  states,  if  not  all.  have 
intrastate  WATS  setvice  but  may  have  dif 
ferent  characteristics  and  are  not  the  subject 
of  this  paper  The  service  applies  only  to  out 
ward  calls  from  a  single  location  The  in 
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terested  reader  is  referred  to  [1]  for  all  the 
details  of  this  tariff  Its  saliant  numerical  pa¬ 
rameters  are  described  in  the  following  sec 
tions 

In  order  to  use  WATS  service,  the  user  or¬ 
ders  one  or  more  lines,  which  we  refer  to  as  a 
group,  in  one  or  more  of  the  6  bands  A  line 
in  a  given  band  may  be  used  to  place  an  inter¬ 
state  call  to  any  area  code  in  that  band  Note 
that  if  the  user  has  both  Band  1  and  Band  2 
groups,  for  example,  a  call  to  an  area  code  in 
Band  1  may  use  a  line  in  either  group.  In  addi¬ 
tion  to  ordering  lines,  users  must  decide  on 
how  to  route  calls  That  is.  they  must  decide 
whether  to  allow  an  outgoing  call  to  use  a 
higher  numbered  band  if  no  lines  in  its  band 
are  available,  or  overflow  the  call  to  DDD 
(Direct  Distance  Dialing;  i.e.,  the  call  is  placed 
as  an  ordinary  long  distance  call) .  or  block  the 
call  entirely. 

2.  The  New  WATS  Tariff 

The  new  tariff,  as  it  is  used  in  this  paper,  is 
described  in  detail  on  the  8th  revised  page 
14.1  of  AT&T  Tariff  259  (1).  The  significant 
change  from  the  old  tariff  is  that  full  period 
lines  are  eliminated  and  only  measured  lines 
remain  Specifically.  the  cost  of  a  WATS  line 
carrying  Hd  hours  of  traffic  in  the  daytime,  Hf 
hours  of  traffic  in  the  evening,  and  H„  hours 
of  traffic  at  night  costs: 

C  -  27.50  +  Cd(Hd)  +  C,(H,)  +  C„(H„) 

where 

Cd(Hd)  is  the  cost  for  Hd  hours  of  daytime 
traffic,  computed  from  a  piecewise  linear  func¬ 
tion.  i.e.  so  much  per  hour  for  the  first  15 
hours,  so  much  per  hour  for  the  next  25 
hours,  etc  For  example,  for  a  Band  5  line  in 
New  York  City, 

Cd  -  19  91  x  (min(Hd,  15)) 

+  17  71  x  (min  (Hd  -  15,25)1* 
+  15.53  x  (min  (Hd  -  40.  40)1* 
13.13  x  |Hd  -  80)* 
where  (X)  *  is  X  if  X  >  0  and  0  otherwise 


C,  is  computed  in  a  similar  manner  C„  is 
simply  O  L)2  x  H,  The  constants  used  to 
compute  C,  are  exactly  f>5%  of  those  to  com 
pute  Cd  This,  presumably,  reflects  the  fact 
that  DDD  costs  in  the  evening  are  65%  of  the 
day  rates  The  night  time  hourly  cost  is.  ap 
proximately  40%  of  an  average  day  rate, 
again  paralleling  DDD  costs  Table  1  gives  the 
WATS  and  DDD  costs  (estimates)  for  New 
York  City  These  are  used  in  the  computa¬ 
tional  experiments  and  examples  presented  in 
the  rest  of  this  paper. 

Table  1A.  WATS  Line  Daytime  Cost  (Hourly) 


Band 

1st  15 
Hours 

Next  15 
Hours 

Next  40 
Hours 

All  Other 
Hours 

1 

15.13 

13.46 

11  79 

9.98 

2 

17.32 

1542 

13.52 

11.43 

3 

18.14 

16.15 

14.15 

11.97 

4 

18.89 

16.81 

14.74 

12.47 

5 

19.91 

17.71 

15.53 

13.13 

Table  IB.  WATS  Line  Evening  Cost  (Hourly) 


Band 

1st  15 
Hours 

Next  15 
Hours 

Next  40 
Hours 

All  Other 
Hours 

1 

9.83 

8.75 

7.66 

6.49 

2 

11.26 

10.02 

8.79 

7  43 

3 

11.79 

1050 

9.20 

778 

4 

12  29 

10.93 

9.58 

8.11 

5 

12.94 

11.51 

10.09 

8.53 

Table  1C.  WATS  Line  Nighttime  Cost  (Hourly) 


Band 

All  Hours 

1 

5.26 

2 

602 

3 

6.31 

4 

6.56 

5 

692 

Table  ID.  Daytime  Average  Cost 
From  New  York  City 
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Thus,  we  see  ih.it  WATS  and  I  >DI) 
vary  in  similar  ways  It  will  be  useful  for  is  to 
consider  a  model  of  how  cost  varies  with  hand 
as  it  will  allow  us  to  make  a  quantitat.ve  trade 
off  between  traffic  in  different  hands  and  in 
particular  to  approach  the  problem  of  decid- 
inq  on  the  size  of  each  WATS  group  Table  2 
gives  the  relative  costs  in  each  of  the  bands  as 
seen  from  New  York  City  These  factors  ap 
ply  to  all  hourly  costs;  eg.  DDD  costs,  the 
first  15  hours  of  daytime  WATS  traffic,  etc 
The  factors  are  exact  for  WATS  costs  and  rea¬ 
sonable  approximations  for  DDD  costs  These 
factors  are  different  for  different  states,  but 
they  exist  for  all  states  and  are  easily  deter¬ 
mined  from  the  tariff. 

Indeed,  the  entire  WATS  tariff  may  be 
modelled  as  a  bulk  discount  on  DDD.  Given  a 
vector,  D{,,  the  average  per  hour  DDD  cost  in 
Band  b,  the  constants  used  to  compute  Cd. 
C„  and  C„  can  be  thought  of  as  percentages 
of  Db.  For  example,  if  the  average  hourly  day¬ 
time  DDD  cost  for  Band  5  calls  from  NY  City 
were  $26.26  during  the  day,  we  could  write 
Cd  as 

Cd  -  26.26  x  {.758  x  min  (Hd,  15) 

+  .674  x  [min(Hd  -  15,  25)]  - 
+  .591  x  (min(Hd  -  40  40)] - 
+  .5  x  (Hd  -  80]}. 

That  is,  the  WATS  tariff  offers  a  discount  of 
between  25%  and  50%  relative  to  DDD  The 
discount  is.  of  course,  relative  to  the  26  26 
figure.  If  this  figure  were  lower,  the  discount 
is,  relatively,  smaller.  Based  upon  an  assumed 
4  minute  holding  time,  this  rate  appears  to  be 
a  reasonable  estimate.  Shorter  holding  times 
would  give,  comparatively,  higher  discounts 


Tabla  2.  Relative  Costs 


Band 

Factor 

1 

1.000 

2 

1.145 

3 

1.199 

4 

1.249 

5 

1.316 

Tln>  significant  thing  :s  that  the  same  model 
with  the  same  discounts  applies  to  all  bands, 
all  states,  and  all  times  of  day 

The  usage  of  a  WATS  line  in  a  band  is  de¬ 
fined  to  be  the  average  usage  over  the  group 
Thus,  if  there  are  M  lines  in  a  Band  b  WATS 
group  and  together  they  carry  H  hours  of  day 
time  traffic  during  a  month,  then  Hd  =  H/M 
A  larger  group  will  block  less  traffic  and. 
hence  carry  more  traffic  Thus,  if  we  add  a 
line  to  the  group  both  H  and  M  will  increase. 
Since  the  efficiency  of  lines  within  a  group 
decreases  as  the  number  of  lines  increases, 
however.  H  will  increase  more  slowly  than  M 
and  Hd  will  decrease  The  cost/hour  for  traffic 
carried  by  the  group  will  therefore  increase  as 
the  size  of  the  group  increases.  It  will  not. 
however,  exceed  the  DDD  cost  until  the 
$27  50  fixed  charge  becomes  a  significant 
portion  of  the  total  cost  of  the  line. 

The  choice  of  how  to  optimally  route  calls  is 
itself  an  interesting  problem  involving  trade¬ 
offs  between  cost  and  quality  of  service  In 
practice,  a  user's  routing  choices  are  limited 
by  the  capabilities  of  his  local  equipment 
(PABX)  and  may  include  overflow,  queueing 
for  lines  in  some  bands,  blocking,  and  even 
dme-of-day  sensitive  routing.  This  is.  however, 
beyond  the  scope  of  this  paper.  The  simple 
assumption  is  made  that  a  call  is  offered  to  the 
lowest  numbered  band  capable  of  handling  it 
and  if  no  lines  are  available  in  this  band  then 
the  call  overflows  to  each  higher  numbered 
band  and  finally  to  DDD.  if  necessary. 

Thus,  the  focus  is  on  the  problem  of  deter¬ 
mining  the  optimal  number  of  lines  in  each 
WATS  band  group  The  larger  the  number  of 
lines  in  a  group,  the  more  trafiic  the  group 
carries  and  hence,  the  less  traffic  overflows  to 
higher  numbered  groups  and  to  DDD  The 
base  cost  of  a  line  is  very  small,  only  $27  50 
per  month  At  first  glance  one  might  expect 
this  would  lead  to  solutions  with  manv  lines 
However,  as  we  will  see  in  the  following  sec¬ 
tion,  the  tariff  provides  that  the  cost  per  hour 
decreases  as  the  number  of  hours  of  usage  in¬ 
creases  and  that  the  number  of  hours  of  usage 
is  averaged  over  the  lines  in  a  band  Thus,  add¬ 
ing  a  line  to  a  group  in  a  band  will  decrease 
the  number  of  hours  of  usage  per  line  and 
hence,  the  average  cost  per  hour  of  carried 
calls  increases  This  limits  the  number  of  lines 
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desired  in  a  band  dnd  makes  the  problem  in 
teresting 

3.  Optimization  Procedure 

The  optimization  procedure  is  imple¬ 
mented  in  FORTRAN  and  runs  on  a  DEC 
20  60  The  software  package  includes  an  in 
teractive  front-end  which  permits  the  user  to 
edit  a  WATS  configuration,  traffic,  tariff,  and 
parameter  values  Finally,  it  includes  a  full 
spectrum  of  design  and  analysis  modules 
which  allow  him  to  design  configurations  us¬ 
ing  a  variety  of  options  for  queueing,  blocking 
and  overflowing  traffic  For  the  sake  of  clarity 
of  presentation,  we  will  concentrate  on  the 
overflow  version  of  the  problem  in  the  re¬ 
mainder  of  this  paper.  Thus,  we  will  assume 
that  traffic  offered  to  a  group  overflows  to 
each  of  the  higher  numbered  groups  and 
eventually  to  DDD,  if  necessary 

The  procedure  takes  as  input  a  description 
of  the  tariff  as  given  above  and  a  description 
of  the  traffic  to  be  carried  by  the  network.  This 
traffic  description  may  take  one  of  several 
forms  depending  on  what  information  is  avail¬ 
able.  Ideally,  the  user  will  process  billing  infor¬ 
mation  supplied  by  the  local  telephone  com¬ 
pany  and  get  at  least  one  month’s  data  giving 
the  actual  number  of  minutes,  calls  and  the 
actuai  DDD  (Direct  Distance  Dial)  cost  over 
the  given  time  period  Alternatively  estimat¬ 
ing  procedures  may  be  used  if  this  informa¬ 
tion  is  not  available  In  either  case,  the  user 
develops  a  traffic  matrix  which  gives  the  num¬ 
ber  of  erlangs  of  traffic"  to  each  band  DDD 
cost  ner  minute  in  each  band,  which  is  also  an 
input  to  the  program,  may  also  be  developed 
as  an  average  derived  from  actual  call  records 
or  alternatively  may  be  derived  from  an  aver¬ 
age  figure  for  the  band.  Usually,  this  will  not 
be  a  critical  issue  as  DDD  costs  do  not  differ 
widely  from  one  part  of  a  band  to  another 

If  however,  a  large  amount  of  traffic  is 
known  to  go  to  a  location  near  the  edge  of  the 
band  then  the  DDD  cost  of  this  traffic  will  not 
be  typical  for  the  band  and  should  be  com 
puted  using  the  actual  cost  of  calling  that  loca 
tion  Also,  if  the  length  of  all  calls,  or  calls  to 
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certain  b,.nds  is  known  to  be  unusually  short 
then  the  one  minute  minimum  charge  which 
is  applicable  lor  most  long  distance  calls  will 
significantly  increase  the  DDD  cost  In  this 
case,  one  should  obtain  the  actual  number  of 
calls  so  that  a  more  accurate  estimate  of  the 
true  DDD  cost  can  be  computed 

The  major  outputs  from  the  procedure  are 
the  number  of  lines  in  each  group  and  the 
total  WATS  and  DDD  cost  Auxiliary  reports 
are  also  produced,  including  the  cost  per 
band,  the  traffic  offered  to  and  carried  by 
each  band,  details  on  an  hour  by  hour  basis 
and  summaries  of  activity  at  major  locations 
(when  input  supporting  this  is  available)  This 
is  useful  as  part  of  more  global  optimization 
procedures  which  determine  the  placement  of 
tie  lines,  FX  (foreign  exchange)  lines,  and 
tandem  switches. 

Figure  1  is  a  flowchart  of  the  global  optimi¬ 
zation  procedure  It  begins  by  initializing  the 
configuration  to  an  all  band  5  solution,  i.e., 
no  lines  in  bands  1  through  4  and  a  sufficient 
number  of  lines  in  band  6  to  satisfy  a  given 
grade  of  service  during  the  peak  hour."  As 
will  be  seen,  the  procedure  is  not  sensitive  to 
the  starting  configuration  used  In  particular, 
we  have  not  observed  any  significant  varia¬ 
tion  in  the  running  time  of  the  procedure  or 
the  quality  of  the  solution  it  produces  when 
the  starting  configuration  is  changed 

The  procedure  is  constrained  to  produce 
solutions  with  no  fewer  than  LO,  lines  and  no 
more  than  HI,  lines  in  band  /  The  values  of 
LO,  and  HI,,  which  are  inputs  to  the  pro¬ 
cedure  can  be  set  in  several  ways  First,  if  one 
wishes  simply  to  find  an  optimal  solution,  LO, 
can  be  set  to  0  and  HI,  can  be  set  to  a  large 
number  (e  g.  a  number  of  lines  sufficient  to 
carry  99%  of  the  load  offered  to  band  ;  and  all 
lower  bands)  If  the  expansion  of  an  already 
existing  network  or  a  local  modification  of  a 
larger  network  (containing  private  tandem 
switches.  FX  lines,  etc  )  is  being  considered. 
LO,  and  HI,  may  be  so;  to  values  very  close  to 
current  values  Finally  by  setting  LO,  =  HI, 
=  V,  a  specific  configuration  containing  V, 
lines  per  band  can  be  evaluated 


*(ira»fo  "1  ***rvir«  tv  .iv  thr  fraction  of  calk 

X  li  :ni.f  I  ||>  lit'  (lie  ‘ysh'm 


1 


«• 


* 


-• 


Fig.  1.  OPTSUB 


166 


Journal  of  1  ^‘ectomrrwjnicaticpo  Networks 


The  current  solution  being  examined  is 
maintained  in  the  array  VAL  Thus.  VAL, 
contains  the  numbei  of  lines  in  band  i  in  the 
current  solution.  Local  perturbations  to  the 
cunent  solution  are  explored  by  incrementing 
and  decrementing  individual  VAL,. 

The  procedure  EVAL  takes  VAL,  the  traf¬ 
fic  requirements,  and  the  tariff  description  as 
input.  It  then  calculates  the  total  amount  of 
traffic  carried  by  each  WATS  band  (and  by 
DDD)  and  costs  out  the  configuration.  In  the 
case  considered  here,  this  is  done  by  using 
the  Erlang-B  formula  and  allowing  traffic  to 
overflow  freely  from  each  band  through  the 
higher  numbered  bands  and  eventually  to 
DDD,  if  necessary.  In  other  versions  of  the 
procedure,  overflow  and  blocking  are  com¬ 
puted  on  the  basis  of  Rapp's  approximation  to 
Wilkinson's  Equivalent  Random  Method 
14,5]  and  Baner's  equations  [6]  for  estimating 
overflow  in  networks  with  queueing.  EVAL 
also  compares  the  total  cost  of  the  current 
solution  with  MINSUM,  the  lowest  cost  for 
any  configuration  thus  far  evaluated.  If  the 
current  cost  is  less  than  MINSUM,  it  replaces 
MINSUM  and  the  current  configuration  is 
retained. 

The  routines  ADD  and  ROLL  explore  local 
modifications  to  the  current  configuration  and 
comprise  the  heart  of  the  optimization  proce¬ 
dure.  ADD  considers  the  addition  of  one  or 
more  lines  to  band  BAND.  It  proceeds  by  add¬ 
ing  single  lines  to  the  group  in  band  BAND 
until  it  finds  the  cost  of  the  configuration  no 
longer  decreases. 

ROLL,  whose  flowchart  is  shown  in  Figure 
2.  proceeds  along  simitar  lines  attempting  to 
exchange  lines  between  two  bands,  thus  effec¬ 
tively  rolling  traffic  from  one  band  to  another. 
ROLL  begins  by  removing  single  lines  from 
the  group  in  band  FROM  until  the  cost  in¬ 
creases.  It  then  attempts  to  reduce  the  cost  by 
adding  a  line  to  the  group  in  band  TO.  Thus, 
ROLL  considers  both  the  possibility  of  remov¬ 
ing  one  or  more  lines  from  band  FROM  and 
exchanging  one  or  more  lines  in  band  FROM 
for  the  same  number  of  lines  in  band  TO. 
Finally.  ROLL  considers  the  possibility  of  ex¬ 
changing  one  line  in  band  FROM  for  two  lines 
in  band  TO 

It  should  be  r.oiea  that  the  function  of  ADI > 
is  actually  contained  ir.  ROLL  ADD  is  used, 
however,  because  '•  is  roughly  three  tirr.es  as 


fast  as  ROLL  ADD  and  ROLL  are  called  re¬ 
peatedly  to  improve  the  current  solution  via 
local  modifications  until  no  further  progress  is 
made.  The  best  value  prior  to  the  current  cy¬ 
cle  of  local  modification  is  recorded  in  OLD.  If 
after  attempting  all  local  modifications.  MIN¬ 
SUM  is  still  unchanged,  the  procedure  termi¬ 
nates.  A  similar  technique  is  used  to  terminate 
procedure  ROLL. 

An  earlier  version  of  this  algorithm  only 
considered  ROLLing  between  adjacent  bands. 
The  procedure  was  not  as  much  faster  as  one 
might  have  expected  (typically,  the  difference 
was  less  than  a  factor  of  two)  and  the  results 
obtained  were  sometimes  slightly  worse. 

Note  in  Figure  1  that  band  5  Is  treated 
somewhat  differently  from  the  other  bands. 
Variable  BAND  only  takes  values  between  1 
and  4  and  ADD  and  ROLL  are  called  accord¬ 
ingly.  For  each  value  of  BAND,  the  proper 
size  of  the  band  5  group  is  first  determined 
before  other  local  modifications  are  attempted. 
This  is  because  the  tradeoffs  between  other 
bands  are  generally  less  dramatic  than  trade¬ 
offs  involving  band  5.  With  other  bands  the 
WATS  costs,  which  are  generally  similar,  are 
involved.  When  band  5  Is  involved  part  of  the 
tradeoff  is  with  DDD,  which  Is  somewhat 
more  significant. 

In  both  the  flowcharts  in  Figures  1  and  2,  It 
can  be  seen  that  the  algorithm  proceeds  in  an 
orderly  fashion  to  consider  all  reasonable 
local  exchanges  (i.e.,  the  addition  of  a  line  in 
a  band,  the  deletion  of  a  line  from  a  band, 
and  the  exchange  of  a  line  in  one  band  for  a 
line  in  another) .  The  variable  FIVE  in  Figure  1 
is  simply  the  constant  five.  The  variable  BAND 
takes  the  values  one  through  four  and  is  the 
current  band  being  considered  in  a  local  ex¬ 
change.  LOB  AND  takes  values  between  one 
and  BAND-1  and  is  the  other  band  involved 
in  the  current  local  exchange.  At  each  stage. 
EVAL  is  called  to  evaluate  the  current  config¬ 
uration  and  update  MINSUM  if  this  configura¬ 
tion  is  the  best  yet  encountered. 

The  significant  feature  about  this  approach 
is  that  only  local  changes,  i.e.,  the  adding  or 
deleting  of  single  lines,  are  considered  and 
movement  in  a  given  direction  is  halted  when 
no  further  progress  is  encountered  This 
greatly  improves  its  running  time  as  com¬ 
pared  with  a  procedure  considering  a  com¬ 
plete  spectrum  of  alternatives,  which  would 
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involve  considering  a  number  of  alternatives 
equal  to  the  product  over  i  of  (HI,  -  LO,) 

The  running  tune  of  this  procedure  is  essen¬ 
tially  linear  in  the  number  of  perturbations  (at¬ 
tempts  to  move  a  line  from  one  band  to 
another)  examined.  The  time  to  evaluate  a 
perturbation  is  essentially  linear  in  the  number 
of  lines  in  the  network.  The  number  of  pertur¬ 
bations  varies  from  one  network  to  another, 
but  appears  roughly  linear  in  the  number  of 
lines.  The  procedure  ran  for  less  than  one 
minute  for  a  network  with  a  total  of  75  lines  in 
it. 

Theoretically,  in  order  to  guarantee  that 
such  a  procedure  will  converge  to  a  global  op¬ 
timum,  the  function  we  seek  to  minimize 
should  be  unimodal,  i.e..  it  should  possess  a 
unique  minimum  which  is  reached  by  descend¬ 
ing  along  any  sequence  of  local  exchanges.  In 
an  absolute  sense,  the  function  we  seek  to 
minimize  is  not  unimodal.  However,  we  have 
found  it  to  be  nearly  so  in  practice. 

An  argument  can  be  made  for  the  validity 
of  the  procedure  based  upon  the  average  cost 
per  hour  in  each  group.  The  total  offered  load 
is  fixed.  In  all  cases— queueing,  blocking,  and 
overflow— the  total  carried  load  is  also  essen¬ 
tially  fixed.  Thus,  to  minimize  total  cost,  one 
seeks  to  minimize  the  cost  per  hour  of  carried 
traffic.  The  procedure  described  above  can  be 
thought  of  as  one  which  constantly  seeks  to 
move  traffic  from  a  group  with  a  high  cost  per 
hour  to  one  with  a  low  cost  per  hour  and 
stops  when  all  alternatives  have  roughly  the 
same  incremental  cost  per  hour. 

Such  a  procedure  would  work  perfectly, 
yielding  an  optimal  solution  every  time,  if  as 
traffic  was  moved  from  one  group  to  another 
the  cost  per  hour  changed  in  a  consistent 
direction  Unfortunately,  in  this  case,  it  need 
not  As  traffic  is  moved  to  a  group,  the  group 
operates  more  efficiently  and  the  cost  prr 
hour  for  traffic  carried  jy  the  group  goes 
down  Th,s  is  good  as  it  reinforces  oui  deo 
sion  to  move  traffic  into  the  group  Unlortu 
nately.  as  the  traffic  offered  to  a  group  in 
creases,  so  does  the  amount  that  overflows 
from  it  If  the  place  it  ovei flows  to  has  a  higher 
cost  per  hour,  then  it  makes  the  decision  to 
otlet  traffic  Its-  desirable  l  urthermoie.  tin- 
group  from  vOsi:  tieff-C  is  removed  vc ill  Iv 
come  less  c(f  .  lent  imtallv  but  it  one  then  de 
creases  the  >  <>  of  the  gioup.  it  will  become 


more  efficient  Thus,  there  are  many  factors 
working  against  one  another  and  the  net 
result  is  not.  in  general,  clear  All  this  is  com¬ 
pounded  by  the  fact  that  we  are  working  with 
different  amounts  of  traffic  In  different  hours 
and  so  the  effect  of  a  change  is  further  confused 

If  we  examined  the  situation  more  closely, 
however,  we  will  see  that  there  is  a  cause  for 
optimism  The  actual  procedure  does  not  di¬ 
rectly  offer  more  or  less  traffic  to  the  group.  It 
either  increases  or  decreases  the  size  of  the 
group.  It  does  not  calculate  costs  pier  hour;  it 
calculates  the  actual  cost  of  the  network. 
Also,  it  makes  the  smallest  possible  changes, 
working  with  increments  and  decrements  of  a 
single  line.  Thus,  it  makes  decisions  based 
upon  the  total  net  effect  of  a  change.  U  adding 
a  line  to  a  group  decreases  the  total  cost,  it 
will  keep  adding  lines  to  the  group  until  the 
net  effect  reverses.  The  changes  are  made 
gradually  so  that  the  net  effect  can  be  watched 
as  closely  as  possible.  The  entire  process  can 
be  viewed  as  one  of  equalizing  potentials  in 
an  electrial  network  or  as  rolling  water  among 
partitions  within  a  container  The  problem  is 
that  whenever  you  roll  some  water,  some¬ 
body  comes  along  and  alters  the  height  of  the 
partitions. 

if  the  costs  per  hour  in  two  adjacent  groups 
are  nearly  equal,  there  is  little  to  be  gained 
from  rolling  traffic  between  them.  Once  we 
get  to  this  point,  we  can  only  make  small  mis¬ 
takes.  It  is  relatively  easy  to  get  to  this  point 
with  the  WATS  groups  since  their  costs  per 
hour  were  close  to  begin  with.  The  situation 
with  respiect  to  DDD  is  different.  The  whole 
point  of  using  WATS  is  that  it  is  significantly 
cheaper  than  DDD  The  procedure  takes  this 
into  account  by  only  rolling  traffic  between 
groups  which  do  not  overflow  directly  to  DDD 
and  reoptimizing  the  last  group,  which  does, 
directly  Thus,  lor  example,  if  we  are  consid¬ 
ering  the  eflect  of  removing  a  line  from  the 
band  2  group  we  will  also  reoptimize  the 
band  5  group  in  the  case  where  band  2  traffic 
overflows  to  all  higher  numbered  groups.  Re¬ 
call  that  this  ''optimization"  generally  amounts 
to  little  more  than  >  becking  if  it  pays  to  add  a 
line  to  the  band  5  group 

Them  is  only  one  icmatning  cause  for  con- 
iitii  Suppose  we  had  a  problem  with  identi 
cal  trail;,-  in  each  hand  and  we  began  with  a 
,•  .luliun  with  equal  s'/eb  groups  in  each  band 
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It  might  turn  out  that  it  doesn't  pay  to  increase 
or  decrease  the  size  of  any  group  by  a  single 
line,  but  that  a  total  consolidation  into  a  prop 
erly  sized  band  5  group  is  the  optimal  solu¬ 
tion  This  is  the  general  problem  of  an  econ 
omy  of  scale  existing  but  not  manifesting  itself 
until  a  critical  size  is  reached  This  has  not 
turned  out  to  be  the  case  in  any  of  the  specific 
problems  examined  so  far.  but  it  is  something 
to  watch  out  for  One  protection  against  this  is 
to  start  the  optimization  off  first  with  a  balanced 
solution  and  then  with  an  all  band  5  solution 
This  is  what  is  done. 

4.  Experimental  Results 

In  order  to  better  understand  the  optimiza¬ 
tion  procedure  and  the  nature  of  the  solutions 
to  the  WATS  optimization  problem  under  the 
new  tariff,  a  set  of  experiments  was  run  The 
purpose  of  these  experiments  was  to  test  the 
behavior  of  the  cost  function  directly  Thus, 
instead  of  simply  running  the  optimization 
procedure,  we  selectively  enumerated  solu¬ 
tions  in  order  to  more  thoroughly  examine 
the  solution  space  Subsequently,  we  ran  the 
optimization  procedure  and  verified  that  it  did 
indeed  find  the  same  solutions 

Table  3  gives  the  salient  input  data  common 
to  all  experiments  The  time-of-day  profile  is 
assumed  to  be  id  tical  for  all  bands  (not  a 
realistic  assumption,  but  simple  to  input)  and 
is  given  in  Table  3A  Only  daytime  traffic  is 
considered,  again  for  simplicity.  Table  IB 
gives  the  cost  for  WATS  lines;  it  is  taken  from 
the  tariff  for  New  York  City  The  ODD  costs, 
also  estimates  for  New  York  City,  are  given  in 


Table  3.  Tlme-ot-Day  Profile 
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03 

Table  1C  Table  ID  gives  ti 

e  relative  costs  hv 

band  One  can  observe  that  these  factors 
relate  all  WATS  costs  and  DDD  costs  All  ex 
periments  were  run  with  22  engineering  days 
month 

The  first  experiment  had  1  F.rlang  day  of 
traffic  in  each  band  Table  4  summarizes 
some  of  the  configurations  explored  The 
configurations  in  Table  4  are  the  number  of 
lines  in  each  band  Commas  between  the 
numbers  are  omitted  when  they  are  single 
digit  numbers  First  note  that  the  pure  band  5 
solutions  are  unimodal;  i  e  .  they  decrease  in 
cost  initially  and  then  steadily  increase  in  cost 
as  the  number  of  lines  increases  This  is  not 
particularly  surprising  It  is.  however,  signifi¬ 
cant  as  an  incremental  procedure  like  the  one 
described  here  would  not  work  if  the  cost 
were  not  unimodal 

Starting  from  the  best  pure  band  5  solution  — 
00002  (Configuration  3).  the  best  solution 
( *9)  is  reachable  through  local  transformations 
via  the  configurations  3,  6.  and  9  Suppose, 
however,  that  after  Configuration  6,  Configu¬ 
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00000 
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0 
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0 
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00003 

2,153 

0 

0 

0 

0 

2080 

73 

5 
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ration  S  was  rcaclu-d  I  his  would  happen  i( 
we  tru'd  adding  a  lint*  m  hand  d  before  adding 
a  line  in  band  1  Configuration  9  could  still  be 
reached  via  the  path  8.  9.  i  e  by  rolling  one 
line  from  band  3  to  band  1  Configuration  9 
could  also  be  reached  from  a  balanced  solu¬ 
tion  like  Configuration  10  through  the  path 
10.  11.  and  h  Thus  we  see  that  Configuration 
9  is  reachable  :n  a  variety  of  ways  Trus  is  en¬ 
couraging  Vv'e  do  not  know  if  Configuration  9 
is  optimal 

While  we  cannot  be  sure  that  the  procedure 
will,  in  fact,  find  the  optimum,  the  data  in 
Table  4  is  very  encouraging  First,  we  see  that 
the  paths  to  the  best  solution  do  exist  Second 
we  see  that  the  solution  space  is  very  flat 
Most  dead  ends,  if  any  exist,  are  likely  to  be 
within  1  %  of  the  optimum 

Similar  conclusions  can  be  drawn  from 
Table  5A.  which  is  the  result  of  an  experiment 
with  10  Erlangs  per  day  per  band  Here  again, 
the  best  solution.  ("17),  is  reachable  from  the 
band  5  solution  and  balanced  configuration 
(r15)  Here  again,  both  the  band  5  and 
balanced  solutions,  and  hence,  any  dead  ends 
reachable  from  them,  are  within  1  %  of  the 
optimum  The  carried  load  and  cost/hour  are 
given  in  Table  5B  for  Configuration  17  They 
indicate  an  increasing  cost/hour  as  the  band 
increases  with  the  most  substantial  jump  be¬ 
tween  band  5  and  DDD  Other  experiments 
with  3  Erlangs,- day,' band  and  30  Erlangs - 
day/band  were  run  The  results  weie  similar 
to  those  reported  here 
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The  procedure  was  run  on  several  actual 
problems  using  data  derived  from  actual  call 
tapes  An  experienced  designer  then  attempted 
to  improve  the  solution  manually  using  the  :n 
teractive  system  within  which  the  procedure 
was  embedded  In  two  cases  an  enumeration 
was  run.  automatically  stepping  ihrough  all 
possible  solutions  In  no  case  was  any  im¬ 
provement  observed  Thus,  while  we  have  no 
formal  proof  that  the  procedure  converges  to 
an  optimal  solution,  this  empirical  evidence, 
togethei  with  the  essentially  unimodal  nature 
of  the  cost  function,  leads  us  .a  believe  that  it 
does  in  fact  yield  globally  optimal  solutions  in 
most  cases. 

5.  Extensions  and  Further  Conclusions 

The  multiple  hour  nature  of  the  problem 
does  not  seem  to  have  much  of  an  effect  cr 
the  procedure  If  an  off-peak  hour  has  traffic 
similar  to  the  peak  hour,  the  optimization 
should  be  unaffected  If  an  off-peak  hour  has 
much  less  traffic,  most  of  its  baffle  will  be  car¬ 
ried  m-bind  Neither  situation  seems  to  cause 
a  problem 

FX  and  tie  lines  could  be  handled  by  mcreas 
ing  the  number  of  rows  in  the  traffic  matrix  to 
include  candidate  FX  and  tie  lines  Similarly, 
the  procedure  could  be  extended  to  handle  a 
more  general  class  of  overflow  rules  where 
traffic  horn  one  band  is  restricted  to  overflow 
onlv  to  specific  other  bands  In  some  cases, 
where  the  number  of  overflow  options  is 
limited,  an  explicilv  routing  table  would  have 
to  be  enteied 

Queueinu  and  blocking  solutions  could  be 
produced  hv  replacin';  the  I  thing  B  formula 
h\  one  .’.ppiopi  site  1<  •  the  situ  lhon  7  fie  con 
<1  a-.  I'.i.uve  tmm  the  experiments  ah'  ve 
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the  Erlany  B  function  They  nro  thus  likclv  to 
hold  m  the  other  situations  as  well 
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ABSTRACT 

Ve  address  the  problem  of  designing  a  circuit* 
switched  network  for  voice  communications  operating  in  a 
military  environment.  The  circuit-switched  network 
design  problem  may  be  briefly  stated  as:  given  the 
topology,  route  tables  and  control  disapline,  end-to-end 
offered  traffic  and  performance  requirements,  determine 
trutk  group  sizes  such  that  the  requirements  are  satisfied. 
One  of  the  key  requirements  of  a  design  is  that  the 
network  be  survivable,  where  survivability  is  based  on 
different  destruction  scenario  conditions.  Our  objective 
Is  to  guarantee  an  acceptable  level  of  performance  for 
every  rode  pair  and  under  each  of  the  different 
anticipated  damage  scenarios.  The  main  contributions  of 
our  present  work  are  the  development  of  approaches  for 
designing  networks  tliat  simultaneously  satisfy 
performance  requirements  for  different  destruction 
scenarios. 

We  describe  the  architecture  of  a  survivable,  circuit- 
switched  network.  The  key  characteristics  of  the 
survivable  network  design  problem  are  highlighted  and 
differences  with  respect  to  classical  trunk  sizing  problem 
are  pointed  out.  An  important  aspect  of  our  work  is  that 
the  sizing  is  based  on  the  logical  topology  of  the  network 
rather  than  its  trunk  group  topology.  One  design 
approach,  which  we  have  used  successfully,  is  presented  in 
detail. 

1.  WTROOUCTION 

In  this  paper,  we  address  the  problem  of  designing  a 
circuit-switched  network  for  voice  communications 
operating  in  a  military  environment.  The  circuit-switched 
network  design  problem  may  be  briefly  stated  as:  given 
the  topology,  route  tables  and  control  discipline,  end-to- 
end  offered  traffic  and  performance  requirements, 
determine  trunk  group  sizes  such  that  the  requirements 
are  satisfied.  One  of  the  key  requirements  of  a  design  is 
that  the  network  be  survivable,  where  survivability  is 
based  on  different  destruction  scenario  conditions. 
Survivability  constraints  on  a  network  design  include:  a) 
connectivity  requirements,  and  b)  end-to-end  performance 
requirements. 

Satisfying  connectivity  requirements  is  a  classical  and 
relatively  well  studied  problem.  The  main  contributions 
of  our  present  work  are  the  development  of  approaches 
for  designing  networks  that  simultaneously  satisfy 
performance  requirements  for  different  destruction 
scenarios.  Our  objective  is  to  guarantee  an  acceptable 
level  of  performance  for  every  node  pair  and  under  each 
of  the  different  anticipated  damage  scenarios.  This 
objective  is  more  general,  and  in  our  opinion  more 
appropriate,  than  the  goal  of  ensuring  connectivity 


requirements.  We  avoid  the  pitfall  of  designing  networks 
that  satisfy  connectivity  constraints,  but  exhibit 
unacceptably  poor  performance.  An  important  aspect  of 
our  work  is  that  the  design  is  based  on  the  logical 
topology  of  the  network  rather  than  its  trunk  group 
topology. 

The  optimal  design  of  circuit-switched  networks  for 
voice  communications  is  complicated  due  to  a  variety  of 
reasons.  In  order  to  appreciate  this  fact,  we  first  review 
the  salient  characteristics  of  military  networks  that 
distinguish  it  from  commercial  telephony.  They  typically 
have  a  mesh  connected  topology  (switch  interconnections) 
and  use  non-hierarchical  alternate  routing  schemes. 
Other  important  operational  features  that  are  uniaue  to 
military  networks  include  the  presence  of  multiple  levels 
of  traffic  priority,  preemption  of  calls,  provision  for 
multiple  route  search  methods  (normal,  preemptive,  and 
hotline).  The  main  End-To-End  (ETE)  performance 
measures  of  interest  to  a  netwc,  k  designer  are:  Grade  Of 
Service  (GOS),  Probability  Of  Preemption  (POP),  and 
Speed  Of  Setup  (SOS)  of  calls. 

In  this  paper,  we  focus  attention  on  effective  design  of 
networks  that  satisfy  ETE  GOS  performance  requirements 
for  multiple  scenarios  (prespecified  by  the  user).  In  the 
next  section,  we  introduce  relevant  terminology  and 
explain  the  topological  representation  of  the  network.  In 
Section  3,  the  key  characteristics  of  the  survivaoie 
network  design  problem  are  pointed  out.  In  the  final 
section,  we  develop  approaches  for  designing  survivable 
networks.  One  such  approach,  which  we  have  used 
successfully,  is  presented  in  detail. 

It  is  worth  noting  that  recent  literature  [1]  and  [2] 
contain  approaches  to  the  dimensioning  of  networks  for 
more  than  one  busy  hour.  These  multi-hour  sizing 
algorithms  address  a  problem  conceptually  similar  to  ours, 
in  the  seme  that  requirements  of  each  busy  hour  may  be 
thought  of  as  a  scenario.  However,  the  methods  are 
largely  inapplicable  to  us  because  they  consider  networks 
(commercial  telephony)  with  hierarchical  routing,  while 
our  interest  is  in  dimensioning  networks  with  non- 
hierarchical  routing. 

2.  TOPOLOGICAL  REPRESENTATION 

In  this  section,  we  fix  relevant  terminology  and 
concepts.  We  also  elaborate  briefly  on  the  architecture 
of  a  typical  survivable  voice  communication  network  and 
set  the  framework  for  subsequent  sections  of  the  paper. 
Our  discussion  will  focus  on  the  backbone  non-hierarchical 
switched  part  of  the  overall  communications  system. 
Users  are  allowed  to  be  connected  via  dedicated  local 
loops  to  the  switch  sites  anti  routed  via  the  backbone 
switcned  network  to  the  destination  handset. 


The  backbone  topology  of  a  survivable  network  is  deal: 
at  three  level:  of  detail.  These  are:  J)  physical  topology, 
2)  logical  topology,  and  3)  trunk  group  topology.  A 
physical  topology  it  comprised  of  a  set  of  physical  sites 
(e.g.,  switch  sites,  radio  relay  stations),  and  physical  links 
(e.g.,  microwave  links)  as  depicted  in  Figure  1.  The 
physical  elements  (sites  and  links)  are  the  targets  of 
destruction  under  enemy  attack. 
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FIGURE  1:  PHYSICAL  TOPOLOGY 


FIGURE  2:  LOGICAL  T0?0L0GT  AND  LOGICAL 
LINK  COMPOSITION 


A  logical  link  in  a  network  is  a  path  in  the  physical 
network  between  two  switch  sites  without  intervening 
switching.  A  logical  link  is  considered  operational  (i.e., 
not  destroyed)  if,  and  only  if,  all  the  physical  elements 
composing  it  are  operational.  A  oair  of  switches  may  be 
connected  by  more  than  one  logical  link.  If  such  is  the 
case,  the  logical  links  represent  conceptually  different 
communication  paths  between  the  two  switcnes.  .1 
multiple  logical  links  exist  between  a  switch  pair,  there 
may  be  physical  elements  (other  than  the  terminal  switch 
sites)  common  to  two  or  more  ol  the  logical  links.  The 
logical  topology  of  a  network  consists  of  a  set  of  switch 
sites  and  logical  links  connecting  site  pa,rs. 

In  Figure  2,  we  show  a  logical  topology  of  the  network 
whose  physical  topology  is  given  in  Figure  1.  There  we 
•everal  potential  logical  topologies  one  can  define  w.tn 
respect  to  the  g  ver  physical  network  topology,  and 
Figure  2  shows  just  one  of  the  choices. 


A  trunk  group  between  two  switch  sites  consists  of  all 
the  logical  links  that  connect  the  site  pair.  II  no  logical 
link  connects  a  site  pair,  then  there  exists  no  trunk  group 
between  the  site  pair.  The  trunk  group  topology  of  a 
network  consists  of  a  set  of  switch  s.tes  and  trunk  groups 
connecting  site  pairs  as  shown  in  Figure  ).  A  route 
between  a  switch  pair  is  specified  as  a  sequence  of  trunk 
groups. 

Connectivity  requirements  on  a  design  stipulate  that 
for  each  switch  pair,  there  must  be  at  least  a  prespecitied 
number  of  physically  disjoint  paths  in  me  physical 
topology  connecting  the  switches.  Connectivity 
constraints  are  tooolcgicai  in  nature  and  are  taken  mtc 
account  in  selecting  tne  logical  topology  and  routes  lor 
u'iffeier.t  switch  pairs. 


FIGURE  3:  TRUNK  GROU°  TOPOLOGY 


Associated  with  each  physical  link  is  its  maximum 
capacity.  It  is  expressed  as  the  number  cl  telephone  caUs 
that  the  link  can  transmit  at  the  same  time.  The  size  (s. ) 

of  logical  link  /  is  the  number  ol  trunks  it  contains,  and 

cannot  exceed  the  capacity  ol  minimum  ol  component 
physical  links  in  it.  The  size  ol  a  trunk  group  is  tne  sum 
ol  the  sizes  ol  logical  links  that  comprise  it.  Figures  1 
through  3  indicate  the  sizes  ol  a  hypothetical  design. 

The  oost  c  (doUars/month)  ol  logical  Jink  /.  consists 

ol  rvo  components:  1)  a  lixed  cost  lb.)  lor  establishing 

the  logical  link,  and  2)  a  variable  cost  which  is 
proportional  to  its  size.  11  the  cost  per  circuit  in  is  a^, 

the  total  cost  c.  is  equal  to  (b^  ♦  a^s.).  The  specilic  choice 

ol  a  logical  topology  determines  the  lixed  costs  ol  the 
network.  The  total  cost  ol  the  network  is  the  sum  cl  its 
lixed  costs  and  variable  costs.  The  design  proolem, 
considered  in  the  subsequent  sections,  is  concerned  with 
the  determination  ol  a  leas.ble  set  ol  logical  link  sizes 
such  that  the  variable  costs  (i.e.,  trunk  costs)  are 
minimized.  Although  not  addressed  in  this  paper,  the 
designer  interested  in  m.nimizing  the  total  cost  ol  the 
network  must  consider  the  choice  ol  logical  topology  as 
part  ol  the  overaU  design  problem. 

3.  DESIGN  PROBLEM 

The  classical  network  design  problem  may  be  slated  as 
loUows.  Given  the  trunk  grouo  topology,  offered  ETE 
trallic  and  performance  'GOS)  requirements,  >sA  routes 
and  route  control  discipline,  determine  a  set  oi  trunk 
group  sizes  such  that  traffic  and  performance 


requirements  are  satisfied.  The  a.'sove  problem, 
(henceforth  referred  to  ;n  this  paper  a:  :he  Trur>K  5i..ng 
Problem)  requires  the  siz.ng  of  trunk  groups  for  a  smric 
scenario.  A  scenario  is  an  operating  condition  of  tie 
network  characterized  oy  its  unique  set  ol  traffic  and 
performance  requirements. 

The  classical  Trunk  Sizing  Problem  in  networks  with 
non-hierarchical  routing  has  been  weU  studied  in  the 
literature  [3]  and  [<•]  .  Segal's  [3]  algorithm  is  based  on 
single-moment  assumption  (i.e.,  calls  offered  to  each 
trunk  group  have  a  Poisson  distribution)  and  is  applicable 
lor  networks  with  e'ther  originating  office  control  or 
successive  office  control.  Covo  considered  the 

presence  ol  multiple  precedence  levels  of  traffic  and 
developed  an  approximate  method  for  sizing  trunk  greups. 
However,  Covo’s  work  aodresses  the  requirements  of  only 
a  single  scenario.  For  our  computational  studies,  we  used 
Segal's  algorithm  1  or  feasible  trunk  greup  sizing.  »e  no 
not  repeat  Segal's  algorithm  here,  but  refer  the  reader  to 
[3]  lor  details. 

The  problem  ol  designing  a  survivable  network 
(henceforth  referred  to  as  SND  Problem)  requires  a  sizing 
ol  trunk  groups  and  logical  links  that  satisfies  the  traffic 
and  performance  requirements  under  multiple  operating 
conditions  ol  the  network.  This  is  the  most  important 
characteristic  that  distinguishes  the  problem  from  the 
Trunk  Sizing  Problem.  Another  unique  characteristic  cl 
the  SND  problem  is  the  dichotomy  between  trunk  group 
sizing  and  logical  link  sizing.  The  key  aspects  of  this 
distinction  between  trunk  group  sizing  ar.d  logical  link 
sizing  are  as  follows.  In  classical  studies,  a  physical  link, 
is  synonymous  with  a  trunk  group  and  destruction  of  a 
physical  link  or  a  switch  would  result  m  the  tofai 
destruction  of  any  route  that  uses  it.  However,  ler  a 
survivable  network,  the  effect  ol  destroying  a  physical 
element  (lir.k  or  site)  has  the  following  ramifications.  All 
logical  links  that  use  the  physical  element  are  cestroyed. 
The  loss  ol  a  logical  link  would  resuit  in  reducing  the  size 
ol  the  trunk  group  that  contains  the  logical  link.  Thus, 
the  size(s)  ol  one  or  more  trunk  group(s)  will  be  reduced. 
The  effect  on  a  route  affected  by  the  physical  element 
destruction  is  one  ol  the  following:  1)  a  reduction  in  size 
ol  some  trunk  group(s)  belonging  to  the  route,  or  2)  total 
destruction  of  the  route.  The  latter  occurs  when  all  the 
logical  links  composing  any  trunk  group  beiong.ng  to  fhe 
route  are  destroyed.  Unlike  the  classical  Trunk  Sizing 
Problem,  the  loss  of  a  physical  element  does  not 
necessarily  imply  the  total  destruction  ol  one  or  more 
routes. 

We  illustrate  the  above  facts  with  an  example. 
Consider  the  network  whose  logical  topology  2nd  icgicai 
link  composition  are  shown  in  Figure  2.  The  impact  ol 
destroying  relay  site  (see  Figure  1)  would  be  to  oestrov 

logical  links  Lg  and  that  use  R-j  (see  logical  link 

composition  in  Figure  2).  These,  in  turn,  imply  a 
reduction  in  size  of  trunk  group  T^  (containing  from 

32  to  20.  It  can  be  observed  that  size  of  trunk  group  T. 

(containing  L^)  is  reduced  from  33  to  23,  while  trunk 

group  T^  (containing  a  single  logical  link  L^)  is  totally 

destroyed  (size  reduced  to  zero).  Thu-,  the  only  routes 
totally  disrupted  would  be  those  containing  T^,  wrule  the 

rest  of  the  routes  are  likely  to  operate  at  a  derraced 

performance  level. 

For  a  given  scenar.o,  the  ETC  GOS  performance  bf  a 
network  is  a  function  of  the  trunk  gre.b  topology,  trunk 
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group  iizrs,  routes,  and  T.TF.  ottered  traffic  of  that 
scenario.  Honover,  :n  designing  a  su-vivable  network,  we 
have  to  size  the  log. cal  Inks  such  tlwit  (or  each  scenario, 
the  trunk  group  sizes  (as  determined  by  the  surviving 
logical  links)  are  adequate  to  meet  the  requirements  c.1 
the  scenario. 

The  specification  of  the  design  problem  must  precisely 
characterize  the  scenarios  of  interest.  .Vs  an  example, 
our  interest  is  in  sizing  a  network  with  the  following 
requirements.  Under  a  normal  scenario  in  which  no 
network  elements  are  destroyed,  the  network  must  be 
capable  of  satisfying  a  nominal  set  of  offered  traffic  and 
performance  requirements.  The  performance  requirement 

is  G  ,  the  maximum  tolerable  eno-to-end  GOS.  The 
max 

value  of  G  could  be  0.01.  Under  an  overload  scenario, 
max 

no  network  elements  are  destroyed,  but  the  offered 
traffic  between  each  node  pair  increases  to  some  c  (say, 
C  —  1.5)  times  the  nominal  offered  traffic  between  the 
pair.  The  network  sizing  must  satisfy  the  overload  traffic 
requirement  at  a  less  stringent  performance  level  (say, 

G  a  0.10).  In  addition,  we  require  that  whenever 
max 

exactly  one  physical  element  is  destroyed,  the  surviving 
network  must  be  capable  of  handling  the  nominal  traffic 
requirements  at  a  degraded  performance  level  (say, 

G  =  0.15).  We  note  that  for  the  physical  network 
max 

shown  in  Figure  1,  there  are  21  physical  elements 
(6  switches,  &  relays,  and  1 1  links)  and  corresponding  to 
each  element  a  damage  scenario  is  identified. 

In  the  next  section,  we  discuss  rwo  basic  design 
philosophies  for  realizing  feasible  and  near  minimum  cost 
survivable  network  designs. 


ll  <  i  <  si,  identify  the  trunk  grout  top-  l.-gv.  ..vmr 

mures,  ETE  offered  traffic,  a-t  nerfn"  ■  r-.r» 
requirements.  Apply  Segal's  algorithm  [3]  jr.a  com;-  r 
feasible  set  of  trunk  group  sizes  satisfying  the  ^_TE 
requirements  of  scenario  |.  Let  a(.  be  the  oesired  size  for 

trunk  group  i  (I  <  i  <  t.  where  t  is  the  total  nurncer  of 
trunk  groups  under  the  normal  scenario)  in  scenario  |. 

Step  2  requires  the  calculation  ol  loemal  line  sizes 
based  on  the  trurk  group  sizes  (computed  m  Step  I)  to 
that  the  total  monthly  dollar  costs  are  minimized. 

We  devote  the  rest  of  this  Section  '«.!)  to  address 
Step  2.  Consider  a  specific  trunk  group,  sav  i.  Let  the 
trunk  group  contain  m  logical  links  and  let  n  ^  represent 

the  number  of  circuits  in  logical  link  k  of  trunk  grc  c  i. 
Further,  let  Xj^  be  a  binary  variable  defined  as  follows: 


if  logical  link  k  of  trunk  group  i 
destroyed  in  scenario  j 

(I  <  j<s,  1  <  k  m) 

ll  otherwise 


X'ik 


Let  b.^  be  the  cost  per  circuit  in  logical  link  k  in  the 

trunk  group  i.  The  minimum  cost  logical  link  assignment 
(MCLLA)  problem  is  stated  as  follows: 


Minimize: 
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4.  DESIGN  APPROACHES 


We  consider  two  basic  philosophies  for  sizing 
survivable  networks.  We  assume  the  follow, ng  are  given  - 
physical  topology,  logical  topology  and  logical  link 
composition  (under  normal  scenario),  routes,  physical  link 
sizes,  a  precise  characterization  of  the  scenarios  and 
their  requirements.  The  objective  is  to  compute  logical 
link  sizes  such  that  the  requirements  of  each  scenario  are 
satisfied.  Two  basic  design  approaches  we  considered  are: 

1.  Parallel  Approaches. 

2.  Serial  Approaches. 

4.1  Parallel  Approaches 

Suppose  a  network  requires  siztng  of  logical  links  and 
trunk  groups  to  satisfy  traffic  and  performance 
requirements  for  s  different  scenarios.  The  basic 
philosophy  of  a  parallel  approach  is  that  a  feasible 
survivable  design  may  be  obtained  by  independent 
solutions  of  many  instances  of  the  trunk  sizing  problem 
(one  instance  fer  each  scenario)  and  then  an  appropriate 
combination  of  the  solutions  to  determine  the  logical  link 
sizes.  Thus,  the  two  ma|or  steps  in  the  parallel  approach 
are: 

Step  Is  Determining  Trunk  Group  Size 
Requirements. 

Step  2:  Assign  Capacity  to  Logical  Links  to  Satisfy 
Trunk  Group  Size  Requirements. 

Step  1  is  accomplished  as  follows.  For  each  scenario  j 


Subject  to:  m 

E  x)‘k  n.u- a  1  «  j  -c  s,  2  <  i  «  t  (2) 
k=l  ‘  ,J  ~ 

4±nikiu,k’  1ik<-m  (3) 


The  objective  function  (1)  represents  the  total  cost  of 
circuits  in  ail  logical  links  of  the  given  logical  topology. 
There  are  s  iinear  inequalities  in  (2)  for  each  trunk 
group  i,  and  their  interpretation  ts  as  follows.  For 
scenario  j  and  trunk  group  i,  the  total  r.mber  of  circuits 
that  survive  (given  by  the  left  side  of  the  .necuaiity)  mutt 
be  at  least  equal  to  a^,  the  ces.red  size  lor  trunk  group  i 

under  scenario  j.  The  constraints  on  the  Oesign  problem 
may  mandate  a  minimum  size  requirement  i/^l,  anc 


maximum  size  limn  (u^)  for  each  logical  Link  k.  These 

constraints  are  given  by  (3). 

In  addition  to  constraints  (2)  and  (3)  that  are  scecific 
to  trunk  g-oup  i,  there  are  constraints  on  the  number  of 
circuits  that  oan  be  used  in  a  physical  link.  If  j  numoer  of 
logical  links  (possibly  belonging  to  many  different  trunk 
groups)  use  a  pnysica!  link,  me  comomed  circuit 
requirement  of  the  logical  links  using  the  physical  link 
cannot  exceed  the  maximum  specified  limit  for  the 
physical  link.  These  constraints  must  be  handled  bv  the 
logical  link  assignment  algorithm. 

The  MCLLA  prob’em  formulated  ibovc  couid  be  ap¬ 
proach  ed  via  an  integer  linear  prcg'.imming  'ILF! 
algorithm.  However,  ramer  thm  acp:..-.g  a  computa¬ 
tionally  expensive  ILF  altmuthm,  we  >u,.e  cevelrp'-c  me 
following  heuristic  algorithm  to  acruevc  a  >ne„rf  m.nimurr. 
cost  design. 


-  m 


i  '1 


u 


» I 


Our  approach  is  based  on  decomposition  of  objective 
function  (1)  into  t  separable  components  (one  for  each 
trunk  group)  and  minimizing  each  component  incividually. 
The  t  trunk  groups  are  assigned  to  their  respective  logicai 
links  by  considering  them  sequentially.  Suppose  ej,  e,,..., 

e^  Is  a  sequence  in  which  tl>e  trunk  groups  are  considered 

for  logical  link  assignment.  When  the  logical  links  of 
trunk  group  e^  ( 1  <  i  <_t)  are  assigned,  the  sizing  decisions 

made  earlier  for  logical  links  of  trunk  groups  e^ 
ej,...,e^  j  will  be  treated  as  constraints.  These 

constraints  arise  due  to  the  fact  that  (i)  physical  links 
may  be  shared  by  logicai  links  belonging  to  different  trunk 
groups  and  (ii)  upper  bounds  on  size  of  physical  links. 

An  outline  of  the  steps  of  the  MCLLA  assignment 
algorithm  is  given  below. 

Step  a:  Choose  a  sequence  in  which  trunk  groups  are 
to  be  considered  for  logical  link  assignment. 
Without  loss  of  generality,  let  this  sequence 
be  1,2,...  ,t. 

Step  b:  1*1. 

Step  ct  Assign  circuits  to  logical  links  of  trunk 
group  i. 

Step  d:  i  *  1  ♦  1. 

Step  es  If  i  >t  then  stop;  else  repeat  Step  c. 


2.  The  maximum  limit  on  the  capacity  of  some 
physical  link  p  in  logical  link  is  reached  or  the 

upper  bound  on  the  size  of  is  reached. 

If  condition  (1)  becomes  true,  then  we  are  done  with 
the  assignment  meeting  tlie  requirements  cf  scenario  j.  If 
condition  (2)  becomes  true,  then  the  remaining  circuit 
requirements  of  scenario  j  are  fulfilled  by  considering  the 
second  least  expensive  logicai  link/,  that  survives  in 

scenario  j.  Ii  necessary,  the  algorithm  also  considers  tne 
third  least  expensive  logicai  Jink,  and  so  forth. 

Fcr  the  purpose  of  illustration  of  Step  c.  consider  the 
following  example.  Let  a  trunk  group  have  m  (=4)  logical 
links  which  must  be  sized  to  meet  the  trunk  group's 
requirements  in  s  (=5)  scenarios.  Let  the  required  sizes  of 
the  trunk  group  be:  aj  =  59,  aj  =  57,  a^  i  61,  a^  =  40, 

=  85.  Let  the  cost  per  circuit  in  the  logical  links  be: 

bj  s  10,  b^  -  20,  bj  a  8,  b^  =  12.  Let  the  upper  bound  for 

each  logical  link  be  50  and  the  lower  bound  for  each  be  0. 
Let  X,  shown  in  Table  1,  be  a  5  x  4  matrix  such  that  the 

(j,k)1^  entry  is  1  (or  0)  depending  on  whether  logical  link 
k  (1  <  k  <  4)  survives  (or  Jails)  in  scenario  j  (1  «  j  <  5).  The 
logical  link  sizes  (1  <  k  <  4)  are  computed  as  loliows: 

TABLE  1:  MATRIX  X 

LOGICAL  LINK  K 


MCLLA  Algorithm 


Step  a  of  the  MCLLA  algorithm  is  accomplished  as 
follows.  There  is  a  large  number  (t!)  of  possible  sequerce, 
in  which  the  trunk  groups  could  be  considered  for  logicai 
link  assignment.  Since  the  final  solution  (total  cost  of 
logical  link  circuits)  depends  on  the  sequence  in  wruch 
trunk  groups  are  considered,  we  consider  trunk  groups  in  a 
sequence  determined  by  decreasing  order  of  their  size 
requirement.  For  each  trunk  group  i(l  <  i  <  t),  compute 
the  variable  hj  (highest  requirement  of  trunk  group  i  over 

all  scenarios). 

h.  a  max  fa../ 

1  1  <  j  _s  i") 


Sort  the  h.'s  in  non-increasing  order.  Consider  the  tnnk 

groups  for  logical  link  assignment  in  the  sequence 
determined  by  the  above  sort  step. 

We  now  consider  Step  c  in  the  MCLLA  algorithm  and 
describe  how  the  algorithm  works  in  meeting  the  size 
requirements  of  a  specific  trunk  grouo  i.  The  algorithm 
first  considers  scenarios  in  whicn  oniy  one  logical  link  of 
trunk  group  i  survives,  then  it  considers  scenarios  in  winch 
two  logical  links  survive,  and  so  forth.  In  assigning 
circuits  to  fulfill  the  requirements  of  scenario  j,  suppose 
there  is  a  choice  to  assign  circuits  among  two  or  mere 
logical  links.  The  assignment  algorithm  assicns  circuits 
first  to  the  least  expensive  (per  circuit  cost)  logical  link, 
/j  in  that  scenario.  The  assignment  is  mace  untii  either 

of  the  following  two  conditions  becomes  true: 

I.  All  the  circuit  requirements  of  scenario  j  have 
been  fulfilled. 


First,  scenario  4,  with  only  one  survivmg  logical  link 
(Z^)  is  considered.  Its  size,  n^,  is  set  to  40,  the  requirec 

trunk  group  size  in  scenario  4.  Of  the  remain. ng 
scenarios,  scenarios  1,2,  and  3  have  two  surviving  logical 
links  each  and  scenario  5  has  all  four  icgica!  links  in  tac*. 
The  constraint  on  scenario  I  is  that  logical  links  ./,  and^ 

be  sized  such  that  their  combined  size  (n,  -  n^)  is  >  59. 

Since  is  cheaper  than  circuits  are  assigned  to  /j 

without  violating  the  upper  bound  constraint  on  .ts  size. 
Hence,  n,  is  set  at  the  maximum  permissible  value  of  50. 

Since  th'S  falls  short  of  the  required  size  by  9  circuits,  the 
value  of  nj  is  set  to  9.  For  scenario  2,  (nj  ♦  n^)  must  be 

>57.  The  algorithm  first  considers  which  is  cneaoer 

than  /,.  Since  /,  is  at  its  maximum  permissible  size. 

circuits  are  assigned  to  the  second  least  exDe-isive  logical 
Lin*,  n^  is  set  to  7.  The  algorithm  proceecs  ..i  a 

similar  manner  and  ends  up  with  the  tcliov  mg  logical  link 
sizes: 

nj  =  21,  rij  =  7,  *  50,  n^  =  4C. 


*.2  S<~  ial  Approaches 


The  basic  philosophy  of  a  serial  approach  is  that  a 
survivable  network  design  may  be  obtained  by 
considering  the  requirements  of  each  scenario  in 
sequence.  The  initial  goal  is  to  identify  the  scenario 
whose  requirements  are  the  most  dominant  (according  to 
a  criterion,  well-defined  by  the  designer)  over  ?il  me 
scenarios.  The  trunk  groups  are  ci..ed  and  the  required 
trunk  group  circuits  are  apportioned  to  the  logical  unks 
that  survive  in  the  most  dominant  scenario.  When  the 
requirements  of  a  nor>-dominant  scenario  j  are 
considered,  a  baselire  cesign  is  first  oerived  from  a 
knowledge  of  sizes  of  logical  links  dimensioned  prior  to 
scenario  j.  The  requirements  of  scenario  ]  are  sought  to 
be  fulfilled  by  incremental  modifications  with  respect  to 
the  baseline  design. 

Despite  the  intuitive  appeal  of  the  serial  approach  in 
designing  networks  with  nearly  "similar"  scenarios,  there 
are  a  number  of  disadvantages  to  it.  Cur  assessment  is 
that  the  accuracy  of  the  serial  approach  is  quit* 
sensitive  to  the  sequence  in  which  scenarios  axe 
considered.  Furthermore,  the  accuracy  depends 
significantly  on  the  use  of  effective  algorithms  for 
incremental  sizing  (from  one  scenario  to  tne  nest). 
Effective  algorithms  for  marginal  sizing  are  known  in 
the  context  of  hierarchical  networks  [5j ,  though  not  as 
much  is  knewn  for  the  ricn-hieraro.ica.  routing  network's 
of  interest  to  us.  A  third  and  very  significant 
disadvantage  of  the  serial  approach  is  its  extremely  high 
computational  time  requirement  when  compared  to  the 
parallel  approach.  W'-en  a  baseline  cesign  is 
incrementally  modified  to  meet  the  requirements  cf  a 
new  scenario,  the  ETE  grade-of-service  must  be 
computed  to  verify  if  indeed  the  requirements  have  been 
satisfied.  Thus  the  serial  aoproacn  requires  many 
computationally  expensive  performance  analysis  runs  as 
part  of  its  incremental  sizing  sneps.  From  our 
computational  experience,  we  have  found  the  parallel 
approach  to  yield  good  feasible  designs  within  reasonable 
computer  time. 
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Delay  and  Overhead  in  the  Encoding  of  Data  Sources 

J  E  HAYES.  shNiofc  mi mbf-k.  im.  and  R  R  BOORSTYN  m  bf-r  ,  ieee 


Abstract— A  basic  properfv  of  data  sources  in  interactive  appli¬ 
cations  is  hurstiness.  i.e.,  short  periods  of  activity  followed  by  Ion* 
idle  periods.  In  these  same  applications  message  delay  is  the  primary 
performance  criterion.  The  combination  of  bursty  How  and  a  delay 
criterion  leads  to  a  source  encoding  problem  in  which  delay  plays  a 
central  role.  A  salient  feature  of  this  problem  is  that  there  is  a  tradeoff 
between  delay  and  the  number  of  protocol  bits  required  to  represent 
the  state  of  the  source. 

A  simple  model  of  a  bursty  source  is  studied  with  the  objective  of 
understanding  the  relationship  between  coding  efficiency  and  delay. 
Two  encoding  schemes,  a  block  encoding  technique  and  a  technique 
employing  flags,  are  examined  in  some  detail,  tor  both  the  block 
encoding  and  the  flag  schemes,  a  significant  result  is  that  as  the  source 
becomes  less  bursty,  delay  grows  without  bound.  This  result  is 
obtained  in  spite  of  the  fact  that  both  schemes  are  reasonable  and  in 
the  limit  the  encoding  problem  disappears.  It  also  appears  that  (he 
flag  encoding  technique  has  much  smaller  delay  than  block  encoding. 

INTRODUCTION 

THIS  paper  is  concerned  with  the  interplay  of  delay  and 
overhead  in  the  transmission  of  data  traffic  over  a  syn¬ 
chronous  channel.  The  relationship  of  these  quantities  is  due 
to  certain  unique  properties  of  data  in  an  interactive  environ¬ 
ment.  Data  traffic  is  bursty,  i.e.,  there  are  short  busy  periods 
interspersed  with  relatively  long  idle  periods  which  contain  no 
information.  (This  stands  tn  contrast  to  voice  and  video, 
where  pauses  or  blanks  may  be  significant.)  The  second  rele¬ 
vant  property  of  data  is  the  effect  of  delay.  In  many  applica¬ 
tions  data  traffic  is  highly  interactive,  placing  a  premium  on 
rapid  response  tunes.  The  relationship  of  these  quantities  al¬ 
lows  one  to  formulate  a  source  encoding  problem  which  con¬ 
siders  delay  as  a  basic  component.1 

Clearly,  the  long-term  average  rate  of  data  generation  at  the 
source  must  be  less  than  the  capacity  of  the  synchronous 
channel.  However,  long-term  averaging  will  lead  to  a  violation 
of  the  delay  requirement  and  a  reduced  encoding  interval  is 
indicated.  Short-term  encoding  leads  to  increased  overhead 
since  there  will  be  times  when  there  is  nothing  to  transmit  a 
source  state  which  must  be  conveyed  to  the  receiver  unambig¬ 
uously.  In  the  sequel  the  problem  of  striking  a  balance  be¬ 
tween  overhead  and  delay  is  studied  for  a  basic  source  model. 
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To  a  large  extent  the  role  of  delay  in  this  context  has  not 
been  considered  by  information  theorists.  The  earliest  related 
work  was  done  by  Jelinek  [2 )  and  was  continued  in  collabora¬ 
tion  with  Schneider  (3),  (4) .  The  model  we  consider  is  de¬ 
picted  in  Fig.  1.  A  source  produces  symbols  at  a  fixed  rate. 
If  the  symbols  produced  by  the  source  are  not  equally  prob¬ 
able.  then  by  using  variable  length  codes,  such  as  Huffman 
codes,  the  average  number  of  bits  that  need  to  be  transmitted 
over  the  channel  in  a  time  interval  can  be  minimized.  In  order 
to  implement  this  technique  buffering  is  required  since  the 
channel  operates  at  a  fixed  rate  and  the  bits  are  produced  by 
the  encoder  at  a  variable  rate.  Jelinek  and  Schneider  studied 
the  relationship  between  the  probability  of  buffer  overflow 
and  efficient  source  encoding. 

We  shall  consider  a  simple  model  of  a  source  which  em¬ 
bodies  these  basic  quantities.  Several  techniques  for  encoding 
this  source  are  studied  with  the  focus  upon  delay  rather  than 
storage.  A  particular  technique  using  flags  is  examined  in 
detail.  Related  work  on  the  topic  has  been  done  by  Gallager 

[5] ,  who  applied  rate  distortion  theory  to  the  coding  prob¬ 
lem  with  delay  as  the  distortion  measure.  Recently  Humblct 

[6]  has  considered  the  problem  of  encoding  randomly  arriving 
messages  so  as  to  minimize  delay. 

SOURCE  MODEL 

Consider  the  following  model:  a  source  produces  either  a 
blank  or  a  binary  digit  once  every  second  with  the  following 
probabilities. 

P  [blank]  =  1  —  P 

P  [0  output]  =P/2  (1) 

/’ll  output)  =  P/2. 

Successive  symbols  are  independent.  The  sequence  of  0's  and 
1  s  produced  by  this  source  is  to  be  carried  over  a  synchronous 
error-free  binary  channel  with  a  capacity  of  one  bit  per  second. 
It  is  assumed  that  blanks  are  of  no  interest  to  the  information 
sink  at  the  receiver.  This  source  produces  data  at  an  average 
rate  of  P  bits/s.  Thus  for  P  <  1  there  will  be  times  when  the 
source  has  nothing  to  transmit,  and  this  condition  of  the 
source  must  be  conveyed  over  the  channel  by  means  of  over¬ 
head  bits  transmitted  along  with  information  bits.  As  we  shall 
see.  the  number  of  these  overhead  bits  that  are  required  is  a 
function  of  the  allowable  delay. 

An  example  of  the  relationship  between  delay  and  the  over¬ 
head  information  is  given  by  the  following  encoding  proce¬ 
dure  All  of  Ihe  source  outputs  in  an  /  second  interval  are  en¬ 
coded  m  a  block.  In  each  I  second  interval  the  information 
shat  is  transmitted  is  the  number  of  data  hits  generated  by  the 
soul <.e  and  then  values  It  more  than  /  bits  are  required  the 

i7s StH'  7'  i-j  I "81  III  f; 
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Klg.  1.  Source  en^uJinc  model 


excess  is  buffered.  If  this  requires  less  than  /  bits,  zeros  can  be 
transmitted  in  the  slack  interval  remaining  aftei  the  buffered 
bits  with  no  ambiguity.  Since  the  position  of  blanks  in  an  / 
second  interval  is  of  no  interest,  the  number  of  actual  data 
bits  produced  by  a  source  in  an  /  second  interval  follows  a 
binomial  distribution.  The  entropy  of  the  source  over  an  I 
second  interval  is 


//(/,  P)=  PI-  j  b(i-  /,  P)  log2  b(r,  I.  P) 


t*  0 


where 


b(iJ,P) 


Pi(\ 


(2) 


In  Fig.  2  we  show  plots  of  //(/,  P)/I  as  a  function  of  /  for 
various  values  of  P.  In  fact,  it  can  be  shown  that 


/ 


indicating  that  the  portion  of  capacity  devoted  to  overhead 
becomes  zero  as  the  block  size  is  increased.  There  is  a  price  to 
be  paid  in  performance  since  bit  delay  increases  as  the  block 
length  increases.  Thus,  by  increasing  /,  the  block  length,  and 
by  holding  P  fixed,  the  source  can  be  matched  to  the  channel 
in  the  sense  of  H(l.  P)/l  <  1  (see  Fig.  2).  On  the  other  hand, 
we  can  show  that  for  /  fixed,  //(/,  P)/l  >  1  as P approaches  l . 
Define  e  £  1  -  P.  For  e  small.  //(/,  I  -  e)  a  (1  -  e)l  -  (1  - 
le )  log2(l  -  le)  -  /e  log2/e.  Setting  //(/,  1  -  e)  >  1  we  find 
that 


~i  1  -  It)  log2  ( 1  -  le)  >  le(l  +  log2  le).  (3) 

For  0  <  le  <  the  RHS  of  (3)  is  negative.  Since  the  LHS 
of  (3)  is  positive  for  0  <  le  <  1 ,  the  assertion  is  proved. 

The  foregoing  illustrates  the  crux  of  the  problem  of  encod¬ 
ing  of  bursty  sources;  the  amount  of  overhead  required  is  a 
function  of  the  block  length  which  translates  directly  into 
delay.  The  same  phenomena  can  be  seen  operating  for  more 
complex  source  models.  As  an  example  consider  the  case  of 
multiplexing  several  distinct  sources  on  the  same  channel.  If 
the  allowable  delay  is  large  enough,  the  overhead  required  to 
distinguish  the  source  can  be  made  negligible.  For  example, 
data  from  each  of  the  sources  could  be  collected  in  separate 
buffers.  The  contents  of  these  buffers  could  then  be  time 
division  multiplexed  onto  the  line.  In  this  way  no  overhead  is 
required  to  indicate  the  sources  of  data.  This  procedure 
breaks  down  when  one  or  more  of  the  source  buffers  is  empty. 
However,  by  making  delay  large  enough,  the  occurrence  of  this 
event  will  have  little  effect  2 

J  For  i  treatment  ot  encoding  of  multiple  sources  see  [ 6 1 


ADAPTIVE  ENCODING 

The  block  encoding  technique  discussed  above  can  be  im¬ 
proved  upon  by  adding  a  measure  of  adaptivity  to  the  process. 
Assume  that  the  encoding  begins  in  the  same  way  as  in  the  pre¬ 
vious  section  by  using  a  Huffman  code  to  transmit  the  number 
of  data  bits  that  the  source  produces  in  an  /(  second  interval. 
The  probabilities  of  the  number  of  data  bits  are  a  function  of 
/ 1  as  well  as  P  [see  (2)) .  Thus  the  Huffman  code  is  in  turn  a 
function  of  /, .  It  is  presumed  that  the  information  sink  knows 
/ !  and  P  and  therefore  knows  the  Huffman  code  that  is  to  be 
used.  Let  us  assume  that  the  transmission  of  the  code  word 
together  with  the  actual  data  bits  consumes  l2  seconds.  The 
source  output  over  the  /2  second  interval  is  encoded  in  the 
same  fashion  as  previously.  Since  in  general  lt  is  not  equal  to 
/2 ,  the  Huffman  code  used  to  encode  the  number  of  data  bits 
is  different.  However,  the  source  and  sink  can  cooperate  in 
the  construction  of  a  code  since  /2  is  known  to  both.  The 
process  repeats  for  each  successive  interval.  This  adaptive 
scheme  is  at  least  as  good  as  the  fixed  block  scheme  since 
encoding  begins  immediately  after  the  last  data  bit  of  a  block 
has  been  transmitted.  The  only  overhead  is  the  Huffman  code¬ 
word  for  the  number  of  bits.  In  the  fixed  block  scheme  mes¬ 
sages  can  be  delayed  while  zeros  are  filling  out  a  block. 

The  successive  coding  intervals  /| .  /; ,  —  in  this  scheme  form 
a  Markov  chain.  Since  there  are  a  large  number  of  states,  find¬ 
ing  the  state  probabilities  is  a  formidable  numerical  problem. 
For  our  purposes  a  Monte  Carlo  simulation  of  a  somewhat 
simplified  model  of  the  scheme  is  adequate  The  simplification 
comes  about  by  upper  and  lower  bounding  the  Huffman  code¬ 
word  by  quantities  that  arc  much  easier  to  compute.  The  un¬ 
certainty  of  a  symbol  with  probability  P,  is  -log2P,.  Let 
[— log2  P,| "  denote  the  integer  part  of  -log2  />,.  If  a  code  with 
lengths  [-log2  Pt\  ~  :i  =  0,  1,  — existed,  its  average  length 
would  lower  bound  the  entropy,  which  in  turn  lower  bounds 
the  average  length  of  a  Huffman  code.  A  code  with  lengths 
[  - log2  P, J  ~  +  I  satisfies  the  Kraft  inequality;  consequently , 
a  code  with  these  lengths  can  be  constructed.  However,  such 
a  code  must  have  average  length  greater  tlun  the  appropriate 
Huffman  code.  The  results  of  the  simulation  are  shown  in  Fig. 
3  where  the  averages  of  the  upper  and  lower  bounds  of  the 
block  lengths,  /, ,  /2 ,  ,  are  plotted  as  a  function  of  log,(  1/ 1 
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Fig.  3.  Performance  of  adaptive  block  encoding  scheme. 

P).  The  results  indicate  that  as  P  approaches  one,  the  channel 
is  matched  by  encoding  over  longer  blocks  which  implies 
larger  delay.  This  result  is  consonant  with  the  results  shown  in 
Fig.  2. 

The  results  of  Figs.  2  and  3  also  suggest  that  for  a  given 
value  of  P,  the  average  length  of  a  block  is  given  by  block 
/  =  H(l,  P).  It  can  be  shown  that  there  exists  an  /*  such  that 
E(l2!l , )  <  /,  if  /,  >  I*  and  F(/2//j)  >  /,  if  /,  <  /*.  In  Fig.  3 
the  locus  of  such  points,  i.e.,  //(/*,  P)  =  /*,  is  shown.  We 
notice  in  Fig.  3  that  as  P  increases,  this  equilibrium  point 
occurs  for  larger  and  larger  values  of  /,  indicating  larger  and 
larger  coding  delay.  In  fact,  it  can  be  shown  that  the  solution 
to  the  equation  is  approximated  by 

/•  s  2. 78/(1  - P)  (4) 

for  P  close  to  one.  Consider  the  entropy  of  a  binomial  distri¬ 
bution  with  parameters  P  and  l.  The  maximum  probability  is 
approximately  equal  to  (2n/Xl  —  P)i]~1^2  implying  that  the 
entropy  is  greater  than  or  equal  to  log2(2irF(l  -  F)/).  Now,  a 
theorem  attiibuted  to  J  M.  Massey  states  that  the  entropy  of 
a  discrete  random  variable  with  variance  a2  is  less  than  or 
equal  to  |  log2(2trc(oJ  +  1/12)).  Since  the  variance  of  the 
binomial  distribution  is  A1  -  F)/,  we  may  approximate  the 
entropy  by  ^  log2(2rre/>(l  -  P)i).  The  solution  to  the  equation 
\  log22rrcF(l  -  Py*  +  F/*  =  /*  is  given  by  (4)  for/* close  to 
one. 


FLAGS 

An  alternative  to  block  encoding  is  a  technique  employing 
flags  Whenever  the  tr  smitter  has  nothing  to  send,  this  state 
is  indicated  b'  if  a  unique  F-bit  sequence,  the  flag. 

During  the  tim  ite  requires  transmit  tliis  flag,  binary 
digits  produced  tlu  jo  e  are  buffered  and  read  out  after 
the  entire  flag  has  been  tiansmitted.  The  output  of  the  source 
is  transmitted  without  encoding  except  when  the  random 
source  replicates  the  first  /  -  I  bits  of  flag  In  this  event  the 
transmitter  stuffs  a  bit  which  is  the  complement  of  the  last 
bit  of  the  (lag.  Thus,  the  only  time  a  flag  is  transmitted  is 
when  the  buffer  is  enipt>  and  a  blank  appears  al  the  output 
ol  the  source. 


As  we  shall  see  presently,  for  each  value  of/*  there  is  a  flag 
length  which  minimizes  bit  delay.  Tins  minimum  strikes  a 
balance  between  flag  bits  and  stuffed  bits.  As  P  decreases  the 
buffer  is  more  likely  to  be  empty;  therefore,  the  flag  length 
should  be  decreased.  However,  decreasing  the  flag  length  in¬ 
creases  the  occurrence  of  stuffed  bits. 

The  frequency  of  occurrence  of  stuffed  bits  in  a  random 
data  sequence  can  be  analyzed  by  means  of  the  theory  of  re¬ 
current  events.  The  buffer  occupancy  can  be  analyzed  by 
means  of  a  Markov  chain  model.  Included  in  both  of  these 
analyses  are  the  parameters  P,  the  probability  the  source  pro¬ 
duces  a  binary  digit,  and  F,  the  length  of  the  flag.  It  can  be 
shown  that  the  results  also  depend  upon  the  particular  bit  pat¬ 
tern  in  the  flag. 

The  structure  of  flags  was  first  studied  by  Nielsen  [7] ,  who 
defined  the  concept  of  bifix  in  connection  with  flags.  A  flag 
is  bifix  free  if  there  is  no  sequence  which  is  a  prefix  and  a 
suffix  to  the  first  F  —  1  bits  of  the  flag.  The  sequences  10000 
and  00001  are  examples,  respectively,  of  bifix  free  and  non- 
bifix  free  flags.  It  is  a  simple  exercise  to  demonstrate  that  non- 
biflx  free  flags  can  cause  delays  in  the  decoding  of  data  bits. 
While  this  is  no  problem  in  terms  of  operation,  the  analysis 
is  complicated.  We  shall,  therefore,  concentrate  on  the  bifix 
free  flag  100  —  0  in  the  sequel. 

The  theory  of  success  runs  as  treated  by  Feller  [8]  allows 
one  to  find  the  generating  function  of  the  time  between 
occurrence  of  stuffed  bits  in  a  stream  of  random  zeros  and 
ones.  For  the  moment  we  consider  successive  source  outputs, 
ignoring  intervening  blanks.  Consider  a  flag  consisting  of  a  one 
followed  by  F  -  1  zeros.  If  the  source  produces  a  one  fol¬ 
lowed  by  F  -  2  consecutive  zeros,  immediately  a  one  is 
stuffed.  This  is  a  renewal  point  since  the  system  erases  all 
memory  and  begins  afresh.  Let  Uj  be  the  probability  of  a 
stuffed  bit  after  the  /th  data  bit.  The  probability  of  a  random 
data  sequence  producing  a  one  followed  by  F  -  2  zeros  is 
2~F * 1 .  However,  a  stuff  bit  can  only  occur  after  F  -  2  zeros. 
We  have 

l//=  (5) 

The  generating  function  of  the  recurrence  interval  is  then 


S(S/2f~ 1 

F(^~  1  -S  +  S(SI2'f-' 
The  mean  recurrence  interval  is 

U=2F~l. 


(6) 


(7) 


In  (7)  the  recurrence  interval  is  in  terms  of  successive  zero 
and  one  outputs  of  the  source.  However,  between  these  out¬ 
puts  are  a  geometrically  distributed  number  of  blanks.  If  these 
blanks  .ire  taken  into  account,  the  average  time  interval  in 
seconds  is  given  by  p//’. 

In  cv'nnection  with  the  frequency  of  occurrence  of  stuff 
bits,  the  question  of  stability  arises.  I  or  the  bursty  model 
the  mean  tunc  between  blanks  is  l/tl  /’I  In  order  that  the 
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transmit  buffer  not  overflow,  it  is  necessary  that  the  mean 
time  between  blanks  be  less  than  the  mean  time  between 
stuffs  For  (8)  we  have  the  criterion 


_  In  (/>/(  1  -/>)) 
r  / 

In  2 


+  1 . 


(8) 


The  overhead  incurred  by  the  use  of  flags  other  than  that 
treated  above  also  fellows  directly  from  an  analysis  of  success 
runs  Furthermore,  the  same  analysis  of  the  frequency  bit 
stuffing  can  be  carried  out  for  the  simultaneous  use  of  several 
different  flags. 

Camrass  and  Gallager  [9]  have  studied  source  encoding 
by  means  of  flags.  There  is  a  similarity  to  our  work  in  that  we 
both  use  the  theory  of  success  runs  to  study  the  frequency  of 
occurrence  of  stuff  bits.  However,  the  difference  lies  in  the  use 
to  which  the  flags  are  put.  Camrass  and  Gallager  append  a 
flag  to  a  message  which  happens  to  have  a  geometric  distri¬ 
bution.  In  our  model  the  burst  of  consecutive  binary  digits 
is  geometrically  distributed,  but  we  do  not  delineate  differ¬ 
ent  bursts.  A  flag  is  transmitted  only  if  there  are  no  source 
output  bits  available. 

BIT  ENCODING  DELAY 

The  encoding  delay  of  a  bit  in  the  flag  scheme  can  be 
studied  by  means  of  a  two-dimensional  Markov  chain  model. 
The  chain  is  in  state  (/,/)  if  there  are  i  bits  in  the  buffer  and  if 
/  of  the  immediate  preceding  successive  bits  in  the  data  se¬ 
quence  coincide  with  the  First  /  bits  of  the  flag.  The  last  com¬ 
ponent  is  the  memory  of  the  system  and  has  dimension  F-  2. 
The  mechanism  for  state  transmission  may  be  described  by 
considering  specific  cases.  If  the  buffer  is  not  empty,  its  con¬ 
tent  decreases  when  the  data  source  produces  a  blank.  If  the 
buffer  is  empty  when  a  blank  is  produced,  then  the  first  bit 
of  the  flag  is  transmitted  and  the  remaining  F  -  1  bits  are 
inserted  into  the  transmit  buffer.  The  level  of  the  buffer  can 
also  increase  due  to  a  stuff  bit.  We  point  out  that  the  stuffing 
of  a  bit  or  the  transmission  of  a  flag  brings  the  system  to  the 
zero  memory  state. 

Let  us  consider  the  flag  100  0,  a  one  followed  by  F  -  l 

zeros.  The  state  transition  diagram  for  this  case  is  shown  on 
Fig.  4.  Let  Qij-,  i  =  0,1,2,  — ;  /  =  0,  1 ,  — ,  F  -  2  denote  the 
steady  state  probability  that  the  system  is  in  state  (/,/).  We 
can  write  a  set  of  equilibrium  equations  by  considering  par¬ 
ticular  sets  of  states.  Consider  the  states  such  that  /  >  0, 
2  Kj  <  F  —  2;  we  have 


SIAtf  OF  MIHORt 

j  . /  .  r  >  •  ! 


Fig.  4.  Markov  chain  model  of  flag  scheme. 


turn  now  to  the  case  i  s*  0,  F  -  1 ,/  **  0.  We  have 

Qi  o  =  (1  ~F)Qi*  i.o  +  (F/2)ql0  +  (PI2)qt_xF_i  (1 1) 
For  i  >  0  and  /  =  1  we  have 

<7/t  =(PI2)q,  +  (l  (12) 

where  qt  =  li,0p~1qii;  /  >  0  is  the  probability  that  the 

bufTer  size  is  i.  The  final  cate  to  be  considered  is  i  =  F  -  1 , 

/*  0. 

QP- 1  ,o  *  (1  ~P)Qf, o  +  (PI2)4f- i  .o 

+  (/*/2)^_i^_j+(1-/0<7o  03) 

Define  the  z-transform  of  the  buffer  occupancy  probabil¬ 
ities  at  a  particular  memory  state  as 

— 

G/O)  =  2  hij-  i  *  o,  1 ,  F  -  2  (14a) 

i*  o 


and  the  buffer  occupancy  moment  generating  function  as 


F-  1 


2  G/O) 


/“0 


(14b) 


Employing  (9)-(  1 3)  together  with  the  definitions  in  (14)  it 
can  be  shown,  after  some  manipulation,  that 


(*-0 


(z-1  +/’)'"* 


qtl  =  (P/2)qlj_l  +(1  -Fk7/+,./.  (9) 

The  state  (i,  /)  is  entered  by  the  arrival  of  a  bit  replicating  a 
flag  bit  with  probability  P/2  or  the  arrival  of  a  blank  with 
probability  l  -  P.  For  i  =  0  and  /  =  0  we  have 

<7oo  =  (•  -^io  +  (F/2>7o0.  (10) 

Notice  that  the  zero  memory  state  unplies  that  a  flag  or  a 
stuff  bit  has  been  transmitted.  The  system  remains  in  the  zero 
memory  state  if  a  zero  is  the  output  of  the  data  source.  We 


•(z-1  +Pf 


2(S  -1)1(1  - 

. 


P)Qoi 


+  (l-P)(z--\+Pf-2(zF~\)qQ0  /(z-1) 

'  1(1  ~P)(z  —  1  +  Pf~2  -  (Pz/2)f~  1 1  (15) 


In  (15)  there  are  F  -  I  unknowns  for  q0o  f  =  0.  I.  . 
F  -  2.  In  order  to  solve  for  these  unknowns  we  utilize  a  tech¬ 
nique  employing  Rouche's  theorem  [10| .  [1 1 1 .  ()(:>  is  ana¬ 
lytic  within  the  unit  disk;  consequently ,  when  the  denomma- 
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tor  has  a  zero  within  the  uni!  disk,  the  numerator  must  have  a 
zero  at  the  same  point.  From  the  stability  condition  of  (8)  it 
can  be  shown  that  the  polynomial  in  the  denominator  of  (15) 
has  F  -  2  di't'n.t  roots  on  the  unit  disk  Let  us  denote  these 
roots  as  0j'.  :  =  1 ,  2 .  ■■■,  F  -  2 .  Since  the  denominator  of 
(J(:)  must  be  equal  to  zero  at  these  points,  we  have  then  from 
( 1 5 )  the  following  set  of  equations. 


(d,  -  1  +  F)l~  1 


-(fl,-l  +r)F~2ViF-  1) 


(1  ~P)Qo, 


+  (1  -«-l  +/>f-W-l)<7oo 

=  0,  i'=  1,2,  -,F-2.  (16a) 

We  also  have  the  normalizing  condition 

2(D  =1.  (16b) 

Equations  (16a)  and  (16b)  give  F  -  1  equations  in  F  -  1 
unknowns. 

Finding  the  roots  of  the  polynomial  P(z)  and  solving  (16) 
for  the  quantities  qol,  l  =  0,  1 ,  — ,  F  -  2  are  relatively  straight¬ 
forward  numerical  problems.  We  can  then  calculate  (/(r), 
the  generating  function  of  the  transmit  buffer  occupancy. 
Returning  to  the  equilibrium  equations  (9  )-<  13).  we  see  that 
the  probabilities  qtj,  i  =  0, 1 , 2,  ■;/  =  0, 1 ,  -.F  -  2  can  also 
be  found.  These  will  be  used  to  calculate  decoding  delay  in  the 
sequel. 

We  use  as  a  measure  of  performance  the  average  delay 
suffered  by  a  bit  as  determined  by  the  average  number  of  bits 
in  the  buffer.  The  average  number  of  bits  in  the  buffer  can  be 
determined  from  Q(z).  The  results  are  shown  in  Fig.  5  where 
q.  the  average  delay,  is  shown  as  a  function  of  flag  length  with 
1  -  P,  the  probability  of  a  blank,  as  a  parameter. 

As  shown  in  Fig.  5,  for  large  values  of  F,  the  average  delay 
may  be  approximated  by  (F  -  l)/2  for  all  values  of  P.  For 
large  values  of  F,  the  buffer  occupancy  jumps  to  F  -  1  when  a 
flag  is  sent.  It  more  or  less  linearly  (on  the  average)  returns 
to  zero  as  blanks  predominate  over  stuffed  bits.  When  it 
reaches  zero  a  flag  is  sent  again.  As  the  flag  length  is  decreased, 
the  stuff  bits  come  into  play.  (Recall  that  from  (7)  the  mini¬ 
mum  value  of  F  is  determined  by  considering  stability  in 
connection  with  stuffed  bits.)  Although  it  is  not  evident 
from  the  cur.cs  shown  in  Fig.  5  for  large  P,  there  is  a  mini¬ 
mum  value  of  average  delay  at  a  value  of  F  larger  than  the 
minimum,  litis  slight  dip  in  the  curves  is  shown  by  the  figures 
in  Table  J. 

As  P  approaches  one,  the  behavior  of  the  flag  encoding 
technique  is  similar  to  the  block  encoding  technique  de¬ 
scribed  in  tne  above.  As  is  illustrated  on  Fig  5,  the  delay  for 
any  value  of  P  may  be  lower  bounded  by  (F  1  )/2.  However, 
from  (10)  we  have  that  F  >  log;(F/l  -  P)  +  1,  and  as  P 
approaches  one,  average  deljv  becomes  infinite 


Fig.  5 .  Performance  of  flag  icheine. 


TABLE  I 


1  -  A" 

7 

8 

9 

10 

11 

12 

2-5 

3.82 

3.78 

4.12 

5.03 

5.51 

2-6 

4.40 

4.30 

4.63 

5.53 

2"7 

4.94 

4.82 

5.13 

5.56 

2-» 

5.47 

5.32 

5.64 

COMPARISON  OF  BLOCK  AND  FLAC  ENCODING 

A  comparison  of  Figs.  3  and  5  shows  a  large  difference  in 
encoding  delay  in  favor  of  the  flag  scheme.  For  example,  for 
P  =  1  -  2“ 4  the  average  block  length  is  approximately  40 
bits.  We  take  this  to  be  an  approximation  to  the  number  of 
bit  intervals  a  newly  arriving  bit  must  wait,  i.e.,  40  s.  In  con¬ 
trast,  the  encoding  delay  for  a  bit  in  the  flag  technique  can  be 
less  than  4  s  for  the  appropriate  flag  length.  These  results 
apply  generally  to  other  probabilities.  Further,  as  P  -*  1, 
delay  in  the  block  scheme  is  proportional  to  1/(1  -  P),  whereas 
in  the  flag  scheme  it  is  proportional  to  log  1/(1  -  P). 

For  the  flag  encoding  scheme,  encoding  delay  is  not  the 
only  delay  encountered  before  a  bit  is  delivered  to  an  informa¬ 
tion  sink  at  the  receiver.  When  a  bit  is  received  there  may  be 
an  ambiguity  that  must  be  resolved.  Moreover,  this  decoding 
delay  is  random  and  depends  upon  the  state  of  the  system, 
begin  the  analysis  by  assuming  the  state  (/,  /)  for  the  flag 
100  —  0.  Recall  that  /  is  the  number  of  data  bits  in  the  trans¬ 
mitting  buffer  and  /  is  the  system  memory  with  respect  to  the 
flag.  The  ambiguity  is  resolved  immediately  if  the  received  bit 
does  not  replicate  the  flag,  i.e.,  a  zero  for  state ;  —  0  and  a  one 
otherwise.  The  ambiguity  persists  if  the  bits  generated  by  the 
source  continue  to  replicate  the  flag,  and  if  the  source  pro¬ 
duces  i  or  fewer  blanks.  To  justify  this  second  requirement  we 
note  that  more  than  i  blanks  would  deplete  the  buffer  and  a 
flag  that  would  be  transmitted  resolves  the  ambiguity.  The 
ambiguity  can  persist  for  at  most  F  -  j  -  1  seconds,  since  the 
Fust  F  -  1  bits  of  the  flag  would  be  replicated  and  a  stuffed 
bit  would  be  transmitted,  also  resolving  the  ambiguity.  We 
summarize  this  in  the  following  equations  where  the  random 
variable  D  denotes  decoding  delay. 

P r\D  >d/(i,f)) 

,  ‘  ,(d  +  l~  l\ 

=  (i;:)(/’/2)‘J  -/*)' '(  J  (17) 

i~-  0  \  i  / 

Lqu.ition  {17)  is  the  probability  of  a  sequence  replicating  the 
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flag  bits  interspersed  with  i  or  fewer  blanks.  Uased  on  (17)  we 
have  for  the  probability  of  delay 

P,[D  =  0/i.y]  =1/:  for  all  i.j 


Pr[D=  l/f./| 


Pr[D  =  d/iJ) 


1/2;  j  =  F  —  2 
1/2  —Pr[D  >  l/(*. /); 

/  =  0,  1.-.F-3 


0; 

P  r[D>d  —  1 /(/,/)] 
Pr[D>d  —  l/(«',/)l 
-p,[D>d/(f,m 


d>F-j 
d  =  F  —j  —  \ 

0<d<F~j -2. 


The  average  delay  can  be  found  by  averaging  over  this  condi¬ 
tional  distribution,  then  averaging  over  the  state  (/,/)  whose 
distribution  was  obtained  in  the  previous  analysis.  The  results 
axe  shown  in  Fig.  6  where  delay  is  plotted  as  a  function  of 
flag  length  with  1  -  /’as  a  parameter.  We  see  that  the  decoding 
delay  is  not  large  and  that  it  is  not  very  sensitive  to  F  and 
1  —  P.  We  conclude  then  that  decoding  delay  has  little  effect 
upon  the  flag  encoding  scheme  as  related  to  the  other  encod¬ 
ing  techniques  we  have  examined. 

CONCLUSION  AND  FUTURE  WORK 

A  simple  source  encoding  problem  which  illustrates  the 
tradeoff  between  delay  and  efficient  encoding  has  been  posed. 
Two  encoding  techniques,  one  using  Huffman  codes  and  one 
using  flags,  have  been  examined  in  detail.  For  the  second  of 
these,  stability  conditions  were  found  and  a  Markov  chain 
model  was  analyzed.  In  both  cases  it  was  found  that  as?-*  1 
the  delay  approaches  infinity.  The  paradox  in  this  result  is 
that  for  P  =  1  the  coding  problem  disappears.  A  secondary 
result  is  the  superiority  of  the  flag  encoding  scheme  over 
Huffman  coding  when  delay  is  used  to  measure  performance. 
This  suggests  a  kind  of  compatibility  between  delay  and  flags. 

We  have  also  found  other  bounds.  All  lower  bounds  found 
tend  to  a  finite  value  as  P  -*  1 ,  while  for  all  upper  bounds 
found  the  delay  becomes  unbounded.  Recently  Roskind  [12 j 
has  found  a  strategy  for  which  the  delay  is  proportional  to 
log  [log  1/(1  -  /»))  as  P  -*  1 . 

The  work  can  be  continued  in  several  directions.  The  most 
compelling  of  these  is  the  study  of  a  lower  bound  for  delay 
over  all  possible  codings  especially  when  P  is  close  to  one. 
Also,  the  source  model  can  be  extended  by  considering  more 
general  arrival  processes,  and  by  considering  several  sources 
which  must  remain  distinct,  multiplexing  can  be  addressed. 
Again  it  appears  that  a  Markov  chain  model  can  be  used  to 
describe  a  flag  encoding  scheme. 
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ABSTRACT 

A  model  of  a  bursty  source  whose  output  Is 
transmitted  over  a  synchronous  channel  is  studied 
with  the  objective  of  understanding  the  relation¬ 
ship  between  coding  efficiency  and  delay.  Two  en¬ 
coding  schemes,  a  block  encoding  technique  and  a 
technique  employing  flags,  are  examined  In  some 
detail.  The  flag  technique  Is  analyzed  by  means  of 
the  theory  of  recurrent  events  and  by  two  dimen¬ 
sional  Markov  chains.  For  both  the  block  encoding 
and  the  flag  schemes  It  is  shown  that  as  the  source 
becomes  less  bursty  delay  approaches  Infinity. 

This  results  obtains  in  spite  of  the  fact  that  both 
schemes  are  reasonable  and  in  the  limit  the  encoding 
problem  disappears. 


INTRODUCTION 

This  paper  is  concerned  with  the  interplay  of 
delay  and  overhead  in  the  multiplexing  of  data  on 
common  facilities.  The  relationship  of  these  quan¬ 
tities  Is  due  to  certain  unique  properties  of  data 
in  an  interactive  environment.  A  basic  property 
of  data  traffic  Is  burstlness,  l.e.,  short  busy 
periods  Interspersed  with  relatively  long  Idle 
periods  which  contain  no  Information.  The  second 
relevant  property  of  data  has  to  do  with  delay.  In 
many  applications  data  crafflc  Is  highly  inter¬ 
active  placing  a  premium  on  rapid  response  times. 
These  properties  of  data  signals  allow  one  to  for¬ 
mulate  a  source  encoding  problem  which  considers 
delay  as  a  basic  component.* 

SOURCE  MODEL 

In  order  to  study  the  problem  of  delay  in  cod¬ 
ing  we  consider  the  following  model.  A  source  pro¬ 
duces  either  a  blank  or  blnarv  digit  once  every 
second  with  the  following  probabilities. 


*  Related  work  on  the  encoding  uf  bursty  sources 

i  : 

has  been  carried  out  by  Callager  and  Humbler  . 
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P [blank)  *  1  -  P 
P [0  output )  -  P/2 

P  i  1  output)  -  P/2 


(1) 


The  sequence  of  0's  and  l's  produced  by  this  source 
is  to  be  carried  over  a  synchronous ,  error-free  bi¬ 
nary  channel  with  a  capacity  of  one  bit  per  second. 
This  source  produces  Information  at  an  average  :..te 
of  P  bits/second.  Thus  for  P  <  1  there  will  be 
times  when  the  source  has  nothing  to  transmit.  But 
the  channel,  being  synchronous,  transmits  only  a 
zero  or  a  one  every  second  and  this  condition  must 
be  suitably  encoded.  In  general  the  number  of  over¬ 
head  bits  that  are  required  to  encode  the  state  of 
the  source  Is  a  function  of  the  allowable  delav. 

The  relationship  between  delay  and  channel 
capacity  can  be  Illustrated  by  means  of  the  follow¬ 
ing  encoding  procedure.  All  of  the  source  outputs 
in  an  l  second  Interval  are  encoded  in  a  block.  I- 
each  l  bit  Interval  the  information  that  must  no 
transmitted  is  the  number  of  binary  digits  gener¬ 
ated  by  the  source  and  the  values  of  these  digits. 
If  this  requires  less  than  l  bits  intervals,  zeros 
can  be  transmitted  In  the  slack  interval  with  no 
ambiguity.  Since  the  number  of  data  bits  produced 
in  an  1  bit  Interval  follows  a  binomial  distribu¬ 
tion,  the  entropy  of  this  encoding  is 


H(t,P)  -  PI  -  [  b(l;t,P)log  b(i;t,P) 


where 


b( 


i-° 

i;t,P)  ~V)fi  (l-P) 


t-i 


(2) 


In  Figure  I  we  show  plots  of  H(lvP)/l  as  a  function 
of  t  for  various  values  of  P.  We  see  that  H(l.P)/t 
asymtot ically  approaches  P  for  large  £.  Thus,  th* 
portion  of  capacity  devoted  to  overhead  be’omv 
zero  as  the  block  size  is  increased.  Howe. or  u  i 
delay  increases  as  the  block  length  invreises. 
increasing  l.  the  block  length,  and  by  boldine  P 
fixed  the  source  can  he  matched  to  the  channel  m 
the  sense  of  H(£,P//£  1  (see  Fig.  1).  On  the 

other  hand  it  can  be  shown  that  for  '  :  Ixt  i 

H(t,P)/l  >  1  as  P  approach* s  l.  Define  e  \  1 

For  f  small  h(?,l-c)  =  -  (1-tr.  >  log,i  l-tc  * 

-  lelog^te.  But  for  H(l,l-e)  '  l  ue  have 

-(l-leUcg^U-te)  >  teU  +  loj^te  ) 

For  the  RHS  of  Fq  .  3  is  negative.  S.r-  e  • 

LHS  of  E  .  3  positive  for  0*  i*  *  1  ,  tie  i  .  •  » 

Is  proved  . 
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The  foregoing  Illustrates  the  crux  of  the  pro¬ 
blem  of  encoding  of  bursty  sources;  the  amount  of 
overhead  required  is  a  function  of  the  block  length 
whiv.h  translates  directly  into  delay.  The  same 
phenomena  can  be  seen  operating  for  more  complex 

2 

source  models  involving  multiple  sources  . 


From  Rq .  4  one  can  find  the  generating  function  of 
the  recurrence  times. 


(z/2)f_1U-z/2) 


l-z+(z/2) 


We  can  calculate  the  mean  recurrence  time  as 


Flags  can  be  used  to  encode  the  basic  bursty 
so  ret-  discussed  above.  The  encoding  process  is 
initiated  by  transmitting  the  digital  output  of  the 
source.  When  a  blank  appears  a  flag  composed  of  a 
unique  F-bit  sequence  is  transmitted.  During  the 
time  interval  required  to  transmit  this  flag  binary 
digits  produced  by  the  source  are  buffered  and  read 
out  after  the  entire  flag  has  been  transmitted.  The 
output  of  the  source  is  transmitted  without  encod¬ 
ing  except  when  the  random  source  replicates  the 
first  F  -  1  bits  of  flag.  In  this  event  the  trans¬ 
mitter  stuffs  a  bit  which  is  the  complement  of  the 
last  bit  of  the  flag.  The  only  time  a  flag  is 
transmitted  is  when  the  buffer  is  empty  and  a  blank 
appears  at  the  output  of  the  source.  For  each  value 
of  P  there  is  a  flag  length  which  strikes  a  balance 
between  flag  bits  and  stuffed  bits.  As  P  decreases 
the  buffer  is  more  likely  to  be  empty  therefore  the 
flag  length  should  be  decreased.  However  decreasing 
the  flag  length  Increases  the  occurrence  of  stuffed 
bits. 

The  frequency  of  occurrence  of  stuffed  bits  in 
a  random  data  sequence  can  be  analysed  by  means  of 
the  theory  of  recurrent  events.  The  buffer  occu¬ 
pancy  can  be  analyzed  by  means  of  a  Markov  chain 
model.  Included  in  both  of  these  analyses  are  the 
parameters  P.  the  probability  the  source  produces  a 
binarv  digit  and  F,  the  length  of  the  flag.  To  some 
extent,  the  results  also  depend  upon  the  particular 
bit  pattern  in  the  flag.  ^ 

The  theory  of  success  runs  as  treated  by  Feller J 
allows  one  to  find  the  generating  function  of  the 
time  between  occurrence  of  stuffed  bits  in  a  stream 
of  random  zeros  and  ones  (For  an  alternative  ap¬ 
proach  see  Ref.  U) .  Consider  a  flag  consisting  of 
•  zeros.  If  the  source  produces  F  -  1  consecutive 
zeros  irar.ed iately  a  one  Is  stuffed.  This  is  a  re¬ 
newal  point  since  the  system  erases  all  memory  and 
begins  afresh  the  counting  or  strings  of  zeros.  Let 
t  be  the  probability  of  a  stuffed  bit  after  the  Jth 

data  bit.  The  probability  of  t lie  source  producing  a 
sequence  of  F  -  1  consecutive  zeros  on  bit  numbers 

J .  J  -  1,...,J  -  F  +  2  is  2  F+^ .  There  is  one  and 
only  one  stuff  bit  after  one  of  these  zeros.  The 
probability  that  the  stuff  is  after  bit  number  J-k 

and  the  next  k  bits  are  zeros  is  0,  ,2  (k-0,1 . 

J-k 

F  -  2>.  Since  F  -  1  possibilities  are  mutually  ex¬ 
clusive  we  have  the  following  recurrence  relation¬ 
ship 

V,  -  ♦  ...  *  U,_f+,2'F+2  -  2'F+1;  j  >  F  -  1 


U0  ’  1 


1  -  (1/2)' 


2F-  2 


The  sensitivity  of  the  stuffing  process  to  the  part¬ 
icular  flag  pattern  is  illustrated  by  considering 
the  flag  100... 0,  a  one  followed  by  F  -  1  zeros. 

The  probability  of  a  random  data  sequence  producing 

F-l 

a  one  followed  by  F  -  2  zeros  Is  (1/2)  ;  however 

a  stuff  bit  can  only  occur  after  F  -  2  zeros.  Ue 
have 

U  -  2~F+1 

The  generating  function  of  the  recurrence  times  is 
then 

'z/2)F-^ 

F,(z) - iillL— -  (7) 

2  1  -  z  +  (z/2) 

The  mean  recurrence  time  la 

U2  -  2F_1  (8) 

It  is  of  Interest  to  note  that  from  Eqs.  6  and  8 


lim  ^1  _  2 


Thus  In  the  limit  the  sequence  1000. . .0  evokes  twice 
as  many  stuff  bits  as  the  sequence  000... 0. 

In  connection  with  the  f requency  of  occurrence 
of  stuff  bits  the  question  of  stability  arises.  For 
the  bursty  model  the  mean  time  between  blanks  is 
1/(1-P).  In  order  that  the  transmit  buffer  not 
overflow  it  is  necessary  that  the  mean  time  between 
blanks  be  less  than  the  mean  time  between  stuffed 
bits.  From  Eq .  6  we  can  derive  the  following 
stability  criterion  for  the  all  zero  sequence. 

r  „  ln(2f(l/l-P)) 


From  Eq .  8  we  have  the  criterion  for  the  sequence 

100. . .0 

f  *  i  do 

In  2 

31 1  delav  for  the  f  1  a scheme  ran  be  studied 
by  meins  of  a  Markov  (ha  in  model.  For  ease  of 
analysis  we  only  consider  the  flag  100... 0.  The 
same  general  conclusions  apply  to  the  all  zero 
flap.  The  state  of  the  chain,  (1,1),  is  two  dimen¬ 
sional  consisting  of  the  number  of  bits  in  the  buf¬ 
fer,  i,  and  of  the  number  jf  bits  In  the  data 
sequence  that  replicate  the  flag,  J.  The  last 
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# 


St 


«.# 


component  Is  the  memory  of  the  system  and  has  di¬ 
mension  F  -  1.  The  mechanism  for  state  transmls 
sion  mav  be  described  by  considering  spec  1 f lc  cases. 
If  the  buffer  Is  not  empty  its  contents  decrease 
when  the  data  source  produces  a  blank..  If  the  buf¬ 
fer  is  empty  when  a  blank  Is  produced  then  the 
first  bit  of  the  flag  Is  transmitted  and  the  re¬ 
maining  F  -  1  bits  are  Inserted  Into  the  transmit 
buffer.  The  level  of  the  buffer  can  also  Increase 
due  to  a  stuff  bit.  Ue  point  out  that  the  stuffing 
of  a  bit  or  the  transmission  of  a  flag  brings  the 
system  to  the  zero  memory  state. 

The  state  transition  diagram  for  the  flag 
100. ..0  Is  shown  in  Fig.  2.  Let  q  ;  1  -  0,1,2...; 

J  •  0,1,...,  F  -  2  denote  the  steady  state  probabi¬ 
lity  that  the  system  is  in  state  (i,J).  Ue  can 
write  a  set  of  equilibrium  equations  by  considering 
particular  sets  of  states.  Consider  the  states  such 
that  1  >  0,  2  <_  J  J  F  -  2  we  have 

qU  ’  (p/2)n.j-i  +  (1-p)<’i+ij  <U) 

The  state  (i,J)  is  entered  by  the  arrival  of  a  bit 
replicating  a  flag  bit  with  probability  P/2  or  the 
arrival  of  a  blank  with  probability  1  -  P.  Now 
consider  the  case  where  a  string  of  zeros  Is  inter¬ 
rupted  by  a  one.  Ue  have 

qoo  ■  (1  -  p)qio  +  (p/2)<00  <12) 

Notice  that  the  zero  memory  state  implies  that  a 
flag  or  a  stuff  bit  has  been  transmitted.  The  sys¬ 
tem  remains  in  the  zero  memory  state  If  a  zero  Is 
the  output  of  the  data  source.  Ue  turn  now  to  the 
case  1  i  0,  F  -  1,  J  •  0.  Ue  have 

"i0  •  +  (P/2)qi0  +  <P/2>Vl.P-2 

(13) 

For  i  >_  0  and  J  >  1  we  have 


Employing  Eqs.  11—14  together  with  the  definitions 
in  Eq.  IS  it  can  be  shown,  after  some  manipulation, 
that 


-  (Z-1+P)F_2  (ZP  -  1))  (l-P)qot 

+  (1-P)  (z-1tP)F'2(zF  -  1 )q 
Q(x). - JliL  -I _ 

(z-1)  ((1-P)  (z-l+P)F~2  -  (Pz/2)F'1l 


In  Eq .  16  there  are  F  -  1  unknowns  for  q ; 

i  ■  0,1,...,  F  -  2.  In  order  to  do  this  we  utilize 
a  technique  employing  Rouche’s  theorem.  Q(z)  la 
analytic  within  the  unit  disk;  consequently,  when 
the  denominator  has  a  zero  within  the  unit  disk  the 
numerator  must  have  a  zero  at  the  same  point.  From 
the  stability  condition  of  Eq .  10  it  can  be  shown 
that  the  polynomial  in  the  denominator  of  Eq.  17 
has  F  -  2  distinct  roots  on  the  unit  disk.  Let  us 
denote  these  roots  as  8^;  1  •  1,2 .  F  -  2. 

Since  the  denominator  of  Q(z)  must  be  equal  to  zero 
at  these  points,  we  have  then  from  Eq.  17  the  fol¬ 
lowing  aet  of  equations. 

F-2  PP.  „  ,  ,  .  , 

-  [  ((Pj-D  (-,-)F  1  (0.-1+V 

l-l  1  1 

-  (81-1+P)F"2-(8F  -  1)1  (1-P)q0l 

♦  (l-P)  (eri+P)F_2  (eF  -  i)q00  -  o, 

1  -  1,2,...,f  -  2  (17a) 

Ue  also  have  the  normalizing  condition 


qil  '  (P/2)  qi  +  (1'P)  qi+l,l  (U) 


Q(l>  -  1 


(17b) 


The  final  cases  to 
i  -  F  -  1,  J  -  0. 

qF-l,0  ’ 


be  considered  are  that  of 

d-p)F,0  -  (P/2>qF_li0 


+  (P/2)qF  2  r  2  +  <1-P)q0  U5) 


where 


F-2 


'» '  Vj 


Define  the  z-transform  of  the  buffer  occupancy 
probabilities  as 


v°  4  l  *iqi  i  :  J  ■  0>1 . F  •  2 

J  1-0  ,J 


(16a) 


and 


F-2 

Q(z)  -  [  Q, (z) 

J-0  J 


(16b) 


Eqs.  17a  and  17b  give  F  -  1  equations  in  F  -  1  un¬ 
knowns. 

Finding  the  roots  of  the  polynomial  P(z)  and 
solving  Eq .  17  for  the  quantities  q,j£,  t  •  0,1,..., 
F-2  are  relatively  straightforward  numerical  pro¬ 
blems.  We  can  then  calculate  Q(z)  the  generating 
function  of  the  transmit  buffer  occupancy.  Return¬ 
ing  to  the  equilibrium  Eqs.  11-15,  we  see  that  the 
probabilities  q1  ^  1  -  0,1,2,...;  J  -  0,1 . 

F-2  can  also  be  found. 

We  use  as  a  measure  of  performance  the  aver¬ 
age  delay  suffered  by  a  bit  as  determined  by  the 
average  number  of  bits  in  the  buffer.  The  average 
number  of  bits  In  the  buffer  can  be  determined 
from  Eq .  16  by  differentiation  or  by  calculating 


-  F-2 

V  1  /  q  .  The  resu3ts  are  shown  in  Fig.  3 

1-0  1-0  1J 

where  q  average  delay  Is  shown  as  a  function  of 
flag  length  with  1-P,  the  probability  of  a 
blank,  as  a  parameter. 


M 


4 


..-•j 
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As  shown  in  F1r.  ),  for  large  value*  of  F, 
the  average  delay  may  be  approximated  by  (F-l)/2 
for  all  values  of  P.  This  Is  as  expected  since 
for  large  values  of  F,  stuff  bits  have  little  ef¬ 
fect.  As  the  flag  length  is  decreased  the  stuff 
bits  come  Into  play.  From  Eq .  10  the  minimum  val¬ 
ue  of  F  is  determined  by  considering  stability  In 
connection  with  stuffed  bits.  Although  it  i*  not 
evident  from  the  curves  shown  on  Fig.  3,  for  large 
P  there  Is  a  minimum  value  of  average  delay  at  • 
value  of  F  larger  than  the  minimum.  As P approach¬ 
es  one  the  behavior  of  the  flag  encoding  technique 
is  similar  to  the  block  encoding  technique  describ¬ 
ed  in  the  foregoing.  As  is  illustrated  in  Fig.  3 
the  delay  for  any  value  of  P  may  be  lower  bounded 
by  (F-1J/2  by  ignoring  the  effect  of  stuff  bits. 
However  from  Eq.  10  we  have  that  F  =  ®  ,  Thus 

as  P  approaches  one,  average  delay  becomes  Infinite. 
Eq.  9  indicates  that  we  may  expect  the  same  sort  of 
behavior  from  the  all  zero  sequence. 

CONCLUSION  AND  FUTURE  WORK 


A  simple  source  encoding  problem  which  illus¬ 
trates  the  tradeoff  between  delay  and  efficient 
encoding  has  been  posed.  Two  encoding  techniques 
have  been  examined  in  detail.  For  the  second  of 
these,  utilizing  a  flag,  stability  conditions  were 
found  and  a  Markov  chain  model  was  analyzed.  In 
both  cases  it  was  known  that  as  P-*l  the  delay  ap¬ 
proaches  Infinity.  The  paradox  in  this  result  la 
that  for  P  *  1  the  coding  problem  disappears. 

The  work  can  be  continued  in  several  direct¬ 
ions.  First  of  all  the  minimum  delay  over  all  cod¬ 
ing  schemes  for  P  close  to  one  is  of  Interest. 
Also,  the  source  model  can  be  extended  by  consider¬ 
ing  more  general  arrival  processes,  (e.g.  Poisson) 
and  by  considering  several  sources  which  must  re¬ 
main  distinct.  Again  it  appears  that  a  Markov 
chain  model  can  be  used  to  describe  a  flag  encoding 
scheme.  Finally  the  subject  of  decoding  delay  for 
Che  flag  scheme  merits  attention.  When  a  bit  forms 
part  of  a  flag  sequence  there  is  an  ambiguity  at 
the  receiver.  The  time  required  to  resolve  this 
ambiguity  is  a  function  of  the  state  of  the  system. 
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Abstract 


We  consider  the  problem  of  obtaining  an  optimal  strategy  for 
packetizing  a  data  stream  which  enters  a  network  one  character  at  a 
time.  Our  objective  is  to  maximize  the  number  of  users  subject  to  a 
delay  constraint  which  includes  both  queuing  and  packetization  delay. 
We  show  that  the  optimal  strategy  is  to  packetize  with  probability  (1-q) 
after  the  arrival  of  N  characters  and  to  otherwise  packetize  at  the 
arrival  of  the  N+l'st  character  where  N  and  q  are  functions  of  the 
delay  constraint.  A  proof  of  optimality  and  numerical  comparisons  with 
other  packetization  strategies  are  given. 
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Introduction 


I . 

Most  packet  switched  networks  are  characterized  by  the  "bursty" 
nature  of  the  traffic  that  they  carry,  i.e.,  by  the  fact  that  the  traffic 
sources  deliver  data  to  the  network  intermittently  with  a  high  peak-to- 
average  rate.  It  is  this  phenomenon  that  makes  packet  switching, 
which  gives  the  communication  channel  to  a  user  only  when  he  actually 
wants  to  use  it,  so  attractive.  Ironically,  however,  in  the  most 
extreme  case  of  burstiness,  when  the  sources  generate  data  one 
character  at  a  time,  packet  switching  may  become  unacceptable  because 
of  the  overhead  due  to  the  header  which  must  be  placed  on  each 
packet.  Thus,  if  an  H  character  header  is  used  with  only  one  data 
character  per  packet,  an  overhead  of  100H%  is  incurred.  Typical 
values  of  H  range  between  6  and  32  in  networks  in  operation  today.  If 
a  random  access  mechanism,  where  users  transmit  at  random  times  and 
contend  for  the  channel  is  used,  a  significant  number  of  additional 
synchronization  bits  must  be  added  to  the  header  and  H  may  become  as 
large  as  60  in  some  cases.  As  random  access  becomes  more  prevalent, 
as  it  is  in  local  area  networks  today,  this  problem  becomes  more  severe. 

An  obvious  solution  to  this  problem  is  to  buffer  input  characters 
until  a  reasonable  number  of  them  can  be  placed  in  a  packet  together. 
If  N  data  characters  are  placed  in  a  packet  with  an  H  character  header, 
the  overhead  is  reduced  by  a  factor  of  N  when  compared  with  the  case 
where  only  a  single  data  character  is  placed  in  each  packet.  Now, 
however,  one  must  also  consider  the  effect  of  packetization  delay.  If 
we  wait  for  N  characters  to  arrive  before  packetizing,  we  incur  a 
packetization  delay  roughly  linear  with  N.  If  the  input  channel  or 
the  source  is  low  speed  and  there  is  a  tight  constraint  on  delay,  this 
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packetization  delay  is  significant  and  must  be  considered  when  selecting 
the  packetization  strategy. 

Such  a  situation  arises  in  practice  when  many  interactive  users 
enter  a  packet  switched  network  over  low  speed  input  lines  and  then 
share  a  high  speed  line  within  the  network,  as  is  shown  in  Figure  1. 

There  are  M  users,  each  of  which  produces  an  average  of  p 
characters  per  slot.  The  slot  is  defined  to  be  the  length  of  time  it 
takes  the  low  speed  input  line  to  transmit  a  character.  Thus  p  is 
always  between  0  and  1.  We  assume  that  the  users  are  independent  of 
each  other  and  also  that  characters  from  a  given  user  arrive 
independently.  In  each  slot,  each  user  produces  a  character  with 
probability  p.  The  high-speed  network  link  shared  by  those  users  has 
a  speed  of  r  characters/slot. 

In  some  applications,  the  initial  network  node  can  echo  characters 
typed  by  the  user,  and  the  problem  of  trading  network  efficiency 
against  delay  perceived  by  the  user  can  be  avoided.  Thus,  the  node 
will  echo  individual  characters  back  to  the  terminal,  thereby  satisfying 
the  user's  constraint  on  delay,  and  also  buffer  a  reasonable  number  of 
characters  for  inclusion  in  each  packet,  thereby  keeping  the  overhead 
due  to  the  packet  header  acceptably  low. 

In  many  applications,  however,  this  approach  is  not  feasible.  For 
reliability,  some  users  prefer  an  echo  from  the  remote  host  to  ensure 
that  the  data  was  not  altered  in  transit  through  the  network.  In  some 
cases,  the  host  echos  not  just  what  is  typed  but  also  fills  in  default 
values  for  the  user.  An  example  of  this  is  a  system  which  will  fill  in 
the  remainder  of  a  keyword  or  file  name  once  the  user  has  entered  an 


unambiguous  prefix  (e.g.,  the  user  types  "COP"  and  the  system  echos 
"COPY  FROM"  because  the  only  command  which  starts  with  COP  is  the 
COPY  command).  Another  reason  for  a  remote  echo  is  that  the  network 
node  may  be  unable  to  echo  characters  on  a  selective  basis  and  unable 
to  handle  the  load  of  echoing  all  characters.  Thus,  the  problem  of 
handling  sources  which  produce  characters  at  a  low  rate  and  which 
require  packetization  within  an  amount  of  time  comparable  to  the  inter- 
arrival  time  of  individual  characters,  is  significant. 

We  thus  consider  a  problem  where  we  are  given  a  constraint  on 
delay,  D  (in  units  of  slots),  which  includes  both  packetization  and 
network  delay.  For  simplicity,  we  consider  the  case  where  the  network 
consists  of  a  single  link  as  shown  in  Figure  1.  Other,  more  complex 
situations  can  be  modeled  by  adjusting  D  to  reflect  the  allowable  delay 
after  subtracting  off  other  delays,  e.g.  host  delay,  and  by  adjusting 
the  expression  for  network  delay  to  reflect  the  presence  of  many  network 
elements.  This  is  beyond  the  scope  of  this  paper  and  will  not  be 
considered  any  further. 

Our  objective  is  to  maximize  M,  the  number  of  users  active  at  any 
time,  subject  to  a  constraint  on  D,  the  overall  delay,  by  obtaining  the 
optimal  packetization  strategy  which  keeps  both  the  packetization  and 
network  delay  small.  Specifically,  we  seek  to  maximize  M  subject  to  a 
constraint  on  D,  the  average  per  character  delay,  where  I)  is  given  by: 


D 


D  + 


s 


where  D  ,  ,  D  ,  and  D  are  the  average  per  character  delays  due  to 

P  K  c>  W 

packetization,  service  (transmission)  and  waiting  (for  the  shared 
network  link),  respectively. 


We  develop  a  packetization  strategy  suitable  for  use  in  tightly 
constrained  situations  as  described  above;  i.e.,  in  situations  when  both 
channel  efficiency  and  time  delay  are  important.  We  prove  the  optimality 
of  this  strategy  and  offer  computational  evidence  to  show  that  our  new 
strategy  increases  channel  efficiency  significantly,  in  some  cases  over 
50%,  when  compared  with  simply  using  fixed  length  packets.  Our 
strategy  is  also  easily  implemented  and  also  offers  the  advantage  of 
small  variations  in  delay  from  packet  to  packet. 


1 1 .  Delay  and  Pa  c  k  etization  Models 

We  model  a  packetization  scheme  by  a  state  transition  diagram  as 
shown  in  Figure  2.  We  consider  only  packetization  schemes  which  form 
packets  entirely  from  characters  from  a  single  user  and  furthermore 
restrict  consideration  to  memoryless  strategies,  i.e.,  strategies  which 
consider  only  what  has  happened  since  the  last  packet  was  formed  in 
making  the  decision  when  to  form  the  next  packet. 

Figure  2  can  be  thought  of  as  the  state  of  the  buffer  corres¬ 
ponding  to  a  single  user.  The  initial  state  (at  the  top  of  the  diagram) 
corresponds  to  the  buffer  being  empty  immediately  after  a  packet  is 
formed.  Transitions  in  the  diagram  correspond  to  state  changes  over 
a  single  slot  of  width  A.  With  probability  p,  a  character  arrives  and  a 
transition  to  the  left  to  a  shaded  node  occurs.  With  probability  1-p,  no 
character  arrives  and  a  transition  to  the  right  occurs.  If  a  packet  is 
formeu,  as  at  node  j  in  Figure  2,  the  transition  is  upward  to  a  state 
immediately  following  the  formation  of  a  packet. 

Each  state,  i,  has  associated  with  it  n.,  the  number  of  characters 
in  the  buffer;  d]  the  aggregate  packetization  delay  suffered  by  all  these 
characters;  and  r. ,  the  probability  of  reaching  state  i.  The  initial 
state,  0,  has  n^=0,  d^=0,  and  r^l .  Figure  3  shows  the  general 

situation  for  a  downward  transition  (i.e.,  from  a  state  in  which  a 
packet  is  not  formed.  Note  that  the  relations  between  the  quantities  at 
nodes  j,  k  and  1  are  not  functions  of  whether  or  not  a  character 
arrived  at  node  j. 

The  packetization  rule  can  be  thought  of  as  choosing  which  nodes 
in  this  tree  to  packetize  at.  Indeed,  any  memoryless  rule  which  forms 
single  user  packets  can  be  so  represented. 


FIGURE  2 


STATE  TRANSITION  DIAGRAM 
CORRESPONDING  TO  A  PACKETIZATION  SCHEME 


FIGURE  3 

INCREMENTAL  EFFECT  OF  NOT  PACKETIZING 
AT  NODE  j 


nk=nj+1 

d.  =d+n. 
k  J  ] 

rk=P'ri 


nfnj 

d,=d.+n. 

1  J  J 

r,=(l-p)  *r. 


A .  Packetization  Delay 

If  the  packetization  scheme  results  in  a  packet  length  distribution 


of 


C 


f.  =  Prob  {Packet  length  is  i  characters} 


then 


the  average  per  character  packetization  delay,  is  given  by 


I 

i  y 


f.  d. 
1  1 


where  d.  is  the  total  average  packetization  delay  suffered  by  all 
characters  in  a  packet  of  length  i  and  y  is  the  average  packet  length 
which  is  given  by 


y  =  1  i  fj 

i 


Since  characters  arrive  independently  at  a  rate  of  p  characters  per 

slot,  the  average  interarrival  time  between  characters  is  -  slots  and  d. 

P  i 

obeys  the  recurrence  relation 


d. 

i 


+ 


iil 

p 


This,  together  with  the  initial  condition  8^=0  yields 


d  = 

i 


iCi-1 


2p 


Thus,  for  example,  if  the  scheme  resulted  in  a  2-point  distribution  with 

f  =l-q  and  f  =q  then 
n  n+1  M 
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1 


D 


=  n+q 


(l-q)n(n-l)  +  q(n+l)n 
pk  (n+q )2p 


=  Yzl  +  qo-q) 

2p  2py 

This  function  is  drawn  in  Figure  4.  As  can  be  seen,  the  function  is 

continuous  and  piecewise-differentiable .  It  behaves  roughly  like  the 

linear  function^—  and  indeed  matches  that  function  for  q=0  and  q=l  . 
2p 

The  difference  between  the  two  functions  is  greatest  near  q=n  ( ( 1+-  )  2-I  ] 
This  difference  decreases  as  y  increases. 

B .  Service  Time 

The  average  service  time  per  character,  D  ,  is  given  by 

s 


i2f. 

D  =  -  (I  — -  )  +  H 

s  r  .  - 

i  y 


where  r  is  the  speed  of  the  high-speed  lines  in  characters/slot  and  H  is 
the  number  of  characters  in  the  packet  header.  For  the  2-point 
distribution  above. 


D  =  -  [  y  +  H  +  j 

s  7  - 

r  y 


Thus  we  see  that  D  behaves  in  a  manner  very  similar  to  D  .  .  It  is 

s  1  pk 

continuous,  piecewise-differentiable  and  follows  a  linear  function  of 
very  closely,  the  difference  getting  smaller  as  y  increases. 


M 


H 


*<  i 


G.  Waiting  Time 

The  exact  relationship  between  the  character  arrival  rate,  packet¬ 
ization  strategy  and  D^,  the  average  per  character  waiting  time  for 
the  high  speed  channel,  is  complex.  W’e  believe  a  reasonable  approxi¬ 
mation  to  this  delay  is  given  by  approximating  D  by  the  waiting  time 
in  an  M/G/l  queuing  system.  Since  the  character  arrival  process  is 
Poisson,  this  seems  reasonable.  Thus,  is  approximated  by 

n  =  ntAx7 
w  2(l-p) 

where  M  is  the  number  of  users 

A  is  the  average  arrival  rate  per  user  in  packets  per  slot 
x7  is  the  second  moment  of  the  service  time  for  a  packet 
and  p  is  the  average  channel  utilization. 

For  the  system  at  hand, 

A  =  E 

y 

p  =  (  Ylh) 

r  y 

f  .(i+H  )2 

P  =  2  - 2  — 

r2 

1 

For  the  2-point  distribution  above, 

x5  =  Ky+H)2  +  q(l-q)] 

and 
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2r2  (1_  rn  y±h 

r  y 


D .  Total  Delay 

From  the  above,  we  have  a  general  expression  for  D,  the  average 
per  character  delay  in  slots, 


2  f  Kill)  ,  i2/y+H  t  MA(i-*-Hj2 
i  1  2py  r  2r2(l-p) 


-  _  pM[(y+H)2  +  q(l-q)] 
'  2r2lv  - 


D 

w 


I 


# 


III.  Derivation  of  the  Opti mum  Packe tiza t ion  Strategy 

We  now  consider  the  problem  of  maximizing  M,  the  number  of  * 

users,  subject  to  a  constraint,  D*,  on  the  average  per  character  total 

delay.  We  thus  seek  the  optimal  value  of  the  packet  length,  y,  and  the 

optimal  packetization  scheme  which  will  yield  this  optimal  length. 

Given  M  and  y,  we  see  from  equation  (*)  that  the  packetization 

scheme  that  minimizes  D  is  one  which  minimizes  the  second  moment  of 

the  packet  length,  Ii2f..  We  are  constrained,  of  course,  to  packet  * 

i 

length  distributions  where  the  packet  length  only  takes  integer  values. 

It  is  easy  to  see  that  the  distribution  which  minimizes  the  second 
moment  subject  to  these  constraints  is  the  2-point  distribution  which  we 
have  been  discussing;  i.e., 

1-q  i  =  n 

f.  =  q  i  =  n+1  ’•* 

1 

0  otherwise 

where  y  =  n+q.  To  see  this,  we  use  the  technique  of  Lagrange 
multipliers  ( 9  j  to  find  the  minimum  of 

Ii2f.  +  a  lif,  +  Bit.  where 
i  .1  i 

i  i  i 

a  and  p  are  the  Lagrange  multipliers.  The  last  two  terms  are  used  to 

satisfy  the  constraints  lif.  =  y  and  If.  =  1.  In  addition,  all  f  ?  0. 

l  1  i  i 

Differentiating  with  respect  to  the  f.  we  get 

>  .# 

=  0  for  all  i  such  that  f.  >  0 

i2  +  ui  +  p  1 

?  0  for  all  i  such  that  f .  =  0 

i 

# 

Thus  there  are  at  most  two  non-zero  values  of  f  that  can  satisfy  the 

l 
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equality  condition  above.  To  satisfy  the  inequality  condition  these  two 
values  of  i  must  differ  by  one. 

Thus  we  see  that  for  a  given  value  of  y  and  M,  the  total  delay  is 
minimized  by  any  packetization  scheme  which  gives  rise  to  a  2-point 
packet  length  distribution. 

We  see  from  this  that  we  need  only  consider  2-point  packet  length 
distributions  in  order  to  obtain  an  optimal  packetization  strategy  since 
minimizing  delay  for  a  given  number  of  users  and  maximizing  the 
number  of  users  for  a  given  delay  are  complementary  problems.  The 
question  of  how  to  find  the  average  packet  length,  y,  still  remains  and 
will  be  discussed  below. 

A  similar  result,  proving  that  a  2-point  packet  length  distribution 
is  optimal,  can  also  be  obtained  by  approaching  the  problem  in  a 
different  way  by  considering  packetization  delay  and  service  time 
together.  Any  packetization  scheme  can  be  thought  of  as  a  decision 
rule  which,  given  the  state  of  the  buffers,  decides  whether  to  packetize 
or  not.  Figure  2  shows  the  state  space  for  this  decision  process. 

Tracing  a  path  from  the  root  to  any  node,  j,  one  can  determine 
the  probability,  r . ,  of  reaching  that  state  by  multiplying  all  of  the  link 
probabilities.  One  can  also  find  the  number  of  characters,  n.,  in  the 
buffer  by  counting  the  number  of  shaded  nodes  in  the  path  from  the 
root  to  node  j.  Finally,  one  can  determine  d.,  the  total  delay  due  to 
packetization  and  service  time  suffered  by  all  characters  in  the  buffer, 
if  a  packet  is  formed  at  node  j.  This  is  done  by  adding  together  the 
delays  of  all  the  characters  in  the  packet. 

Let  x.  be  a  decision  variable  taking  the  value  1  if  we  packetize  at 
node  j  and  zero  if  we  do  not.  Any  packetization  scheme  can  be  defined 
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solely  in  terms  of  the  x . .  Furthermore, 


y  =  I  x.  r.  n. 

i  J  J  J 


D,  =  I  x.  r.  d. 

j  J  '  1 


where  is  the  average  packetization  and  service  delay  for  all 

characters.  We  seek  to  maximize  y  subject  to  the  constraint  D,/y  s  D*. 
There  is  also  an  additional  constraint  that  the  packetization  scheme  be 
consistent.  In  particular,  x.  =  1  implies  that  x^  =  0  for  all  successors 
k  of  node  j  since  once  we  packetize  at  node  j  we  will  never  reach  node 


It  is  somewhat  easier  to  see  what  the  optimal  policy  is  if  we 

consider  the  decision  process  in  a  slightly  different  way.  We  define  a 

new  set  of  decision  variables,  w.,  associated  with  the  nodes  j.  The  w 

1  1 

are  related  to  the  x.  above  by  the  rule  that  if  x.  =  1  then  w  =  1  for  all 

J  1  i 

predecessors  of  j  and  w.  =  0  otherwise.  For  all  consistent  values  of  x., 
the  w  are  well  defined.  Setting  w.  =  1  can  be  thought  of  as  deciding 
not  to  packetize  at  node  i. 

We  now  define  g.  as  the  increment  in  packet  length  due  to  the 

decision  not  to  packetize  at  node  j.  Similarly  we  define  h.  as  the 

increment  in  the  average  total  delay,  Dj. 

The  expressions  for  y  and  Dj  can  then  be  rewritten  in  terms  of 

the  new  decision  variables,  w.,  as  follows: 

J 


1  w.  g . 

i  J  1 


D,  =  1  w.  h. 

j  J  J 
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The  incremental  effect  of  not  packetizing  at  node  j  is  shown  in 

Fiqure  3.  As  can  be  seen  if  w.  =  1,  two  nodes,  k  and  i,  will  be 

) 

counted  instead  of  node  i  in  the  sums  for  y  and  Dj.  Thus: 


g.  =  (l-q)r. 


2n.  +  H  +  1 

(n.  +  — - )r. 

J  c  ) 


In  order  to  maximize  y  for  a  given  D  we  would  prefer  to  choose 

nodes  j  with  maximal  g./h.,  i.e.  nodes  which  maximize  the  incremental 

gain  in  y  per  unit  Dx.  Note  that  in  reality  it  is  Dj/y  not  Dj  which  is 

the  constraint  but  since  we  seek  to  maximize  y  we  are  simultaneously 

loosening  the  constraint  as  much  as  possible. 

In  order  to  maximize  g./h.  we  choose  nodes  i  with  the  smallest 

J  1  1 

possible  n.  and  set  w.  =  1  for  those  nodes.  We  continue  to  pick  nodes 

until  we  find  that  setting  the  next  w.  =  1  would  cause  Dj./y  to  exceed 

D*.  Until  this  point,  we  have  assumed  for  the  sake  of  simplicity  that 

w.  would  only  take  the  values  0  or  1 .  In  fact  we  are  free  to  assign 

any  value  in  the  range  C  to  1  to  each  w  .  At  the  point  where  setting 

w.  =  1  would  violate  the  constraint  on  D1(  there  exists  a  value,  a,  of 

w.  which  would  cause  the  constraint  to  be  satisfied  exactly,  since  D*y 

is  a  monotone  function  of  w..  Thus,  we  set  w.  =  a  for  this  last  j  and 

J  J 

satisfy  the  constraint  with  equality. 

It  is  clear  that  this  procedure  will  indeed  find  an  optimal  set  ot 
values  for  the  w  in  the  sense  that  y  is  maximized.  To  see  this  we 
need  only  note  that  in  order  to  increase  the  value  of  any  w.  above  the 
value  set  by  this  procedure  we  would  have  to  correspondingly  decrease 
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the  value  of  some  other  w^  with  a  resulting  decrease  (or  at  best  no 
change)  in  y.  This  is  because  we  are  selecting  the  nodes,  j,  in  order 
of  g./h.,  largest  first. 

Note  that  at  any  stage  there  is  an  infinite  number  of  nodes  j  with 

the  same  value  of  n.  to  choose  from.  Any  selection  will  yield  exactly 

the  same  value  of  y.  Indeed,  one  need  only  ask  if  w.  can  be  set  to  1 

J 

for  all  nodes  with  a  particular  n..  If  so,  we  do  this  before  asking  the 

question  for  the  next  larger  value  of  n . .  Thus  we  avoid  any  conceptual 

difficulty  with  the  finiteness  of  the  decision  process. 

We  are  left  then  only  with  the  question  the  consistency  of  the 

packetization  policy.  This  too,  however,  is  no  problem.  Note  that  if 

we  set  w.  =  1  if  n.  <  n*  and  also  for  a  collection  of  nodes  with  n.  =  n* 
)  J  ] 

and  with  aggregate  probability  1-q  then  we  have  a  consistent 
packetization  policy  which  adheres  to  the  optimization  criteria.  This 
policy  can  be  simply  stated,  and  equivalently  implemented,  as  waiting 
until  the  n*-—  character  arrives,  flipping  a  coin  which  comes  up  heads 
with  probability  q,  packetizing  at  that  point  if  the  coin  comes  up  tails, 
and  packetizing  after  the  next  character  arrives  otherwise. 

We  have  thus  shown,  again,  that  a  2-point  distribution  minimizes 
delay  due  to  packetization  and  service  time.  Given  M,  the  number  of 
users,  this  also  minimizes  queuing  delay,  as  shown  above.  We  now 
turn  to  the  final  question,  how  to  determine  y,  the  optimal  average 
packet  length,  in  practice. 
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IV.  Computation  j3f_the_Oj3timal  Packet  Length 

Examination  of  the  expressions  for  the  components  of  delay  given 
in  Section  II  shows  that  grossly,  packetization  delay  and  service  time 
grow  linearly  with  packet  length  while  queuing  packetization  delay 
decreases  as  the  packet  length  increases  (since  the  overhead  due  to  the 

header,  which  contributes  to  the  utilization,  decreases).  There  is, 

however,  a  term  proportional  to  q(l-q)  (where  q  is  the  fractional  part 
of  the  packet  length)  present  in  each  delay  component.  This  term 

causes  the  derivative  of  delay  with  respect  to  packet  length  to  be 
discontinuous  and  prevents  us  from  simply  doing  a  gradient  search  to 
find  the  optimal  value  of  y.  As  can  be  seen  by  examining  Figure  2, 
however,  one  might  expect  a  simple  search  to  yield  a  reasonable  approx¬ 
imation  to  y  since  the  q(l-q)  term  is  small  and  becomes  even  less 

significant  as  y  increases.  We  are  thus  led  to  the  following  approach. 

Ignoring  the  q(l-q)  terms  in  I),  we  can  find  y,  an  approximate 
value  of  y,  using  a  gradient  search  or  other  simple  search  procedure. 
"Thus  we  let 


Mp 


(y+H)2 


D  = 


Y-1 

2p 


+  1 


(y+H  > 


2r2  [  1  -MP(y+H) 
r  y 


and 


dl)  1^1 

.  —  +  + 
dy  2p  r 

(liven  M,  we  can  then  minimize 
the  derivative  is  0.  Given  It, 
since  I)  is  monotone  with  M;  i.e. 


mp.  y*")i(]-M£  ><>'*">-  2I( 
y  r  y  y 


2r2  \]-M?  ^+,1h  " 


r 


I)  by  finding  y,  the  value 
the  value  of  M  can  then 
if  1)  is  less  (greater)  than 


of  y ,  when 
be  ad]..,  iC'i 
t he  ;  equir  rd 
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delay,  then  M  can  be  increased  (decreased).  A  binary  search  on  M  can 
thus  be  performed.  A  simple  initial  upper  limit  on  M  for  this  purpose 
is  M=r/p.  Given  these  approximate  values  of  y  and  M,  it  is  then  possible 
to  proceed  in  practice  using  the  real  values  of  D  in^uding  the  g(l-q) 
terms.  The  derivative  is  now  only  continuous  over  individual  integral 
values  of  y  and  so,  strictly  speaking,  a  separate  search  must  be  per¬ 
formed  over  each  such  interval.  In  practice,  at  most  2  such  intervals 
would  have  to  be  searched  once  the  above  approximate  values  were 
determined  since  the  q(l-q>  terms  do  not  alter  the  result  much.  This 
is  verified  empirically  by  the  computational  experience  presented  below. 


•4 


9 


■  9 
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Computational  Experience 


C 


t 


We  now  examine  the  effect  in  practice  of  using  this  optimal  packet¬ 
ization  scheme  rather  than  simply  forming  fixed  length  packets.  This 
issue,  along  with  a  comparison  with  other  strategies,  is  examined  in 
more  detail  in  [1].  A  simple  optimization  procedure  was  implemented  for 
this  purpose,  and  is  described  below. 

The  search  procedure  used  is  shown  in  Figure  5.  The  variables  y 
and  M  are  the  current  values  of  the  packet  length  and  number  of  users 
being  examined,  respectively.  The  variables  y*  and  M*  are  the  current 
estimates  of  the  optimal  values  of  y  and  M.  Dre(^  is  the  delay  require¬ 
ment  in  slots.  The  variables  A  and  T  control  the  length  and  precision 
of  the  search  and  were  set  to  .01  and  2,  respectively.  We  are  interested 
primarily  in  cases  where  y  is  small  and  so  this  simple  linear  search  is 
quite  reasonable.  For  large  y  the  procedure  described  in  the  preceding 
section  could  be  implemented  or,  more  simply,  larger  values  of  A  and  T 
could  be  used  with  this  procedure  since  small  differences  in  y  would 
not  alter  M. 

We  examined  problems  with  p,  the  number  of  characters  per  user 
per  slot,  of  **  and  r,  the  trasmission  speed  of  the  high-speed  channel 
in  characters  per  slot,  of  300.  This  corresponds  to  users  producing 
characters  at  an  average  rate  of  1  per  second  on  a  low-speed  channel 
with  peak  rate  4  characters  per  second  and  sharing  a  high-speed 
channel  of  1200  characters  per  second.  Values  of  H,  the  packet  header 
size,  of  1,  6,  and  32  were  examined. 
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TAB1.1.  1 


COMPARISON  BFTWFFN  OPTIMAL  AM)  ITXFD  SCHFMFS 


req(.slots ) 

Y* 

M* 

^1  IXCfj 

M.  ,  % 

1  ixed 

difference 

.  1 

1  . 00 

579 

1 

579 

0.0 

.2 

1  .01 

590 

1 

589 

0.2 

.5 

1 .09 

612 

1 

595 

2.9 

1 .0 

1 .26 

653 

1 

597 

9.4 

1.5 

1 .50 

701 

1 

598 

17.2 

2.0 

1.83 

755 

1 

598 

26 . 2 

2.25 

2.00 

783 

2 

783 

0.0 

3 

2.26 

816 

2 

795 

2.6 

4 

2.77 

864 

2 

797 

8.4 

4.25 

3.00 

875 

3 

875 

0.0 

5 

3.21 

901 

3 

893 

.9 

6 

3.70 

929 

3 

896 

3.7 

6.5 

4.00 

943 

4 

943 

0.0 

7 

4.13 

953 

4 

952 

0.1 

10. 

5.53 

1003 

5 

994 

0.9 

Table  la  .  01=1  ) 


w 

* 

y 

fixed 

;  j 

%difference 

rixea 

14 

8 

i 

148 

0.0 

17 

2 

i 

167 

2.9 

19 

4 

i 

169 

14.8 

25 

9 

i 

170 

52.3 

29 

3 

2 

291 

0.7 

36 

1 

2 

297 

21.5 

38 

7 

3 

387 

0.0 

56 

3 

5 

540 

4.3 

Table  lb. 

(H=6) 

M* 

yfixed 

M,.  , 

fixed 

%differenre 

22 

1 

22 

0.0 

36 

1 

34 

5.8 

49 

1 

35 

40.0 

67 

2 

66 

1.5 

63 

5 

157 

3.8 

Table  lc. 

(H=32) 

FIGURE  6  NUMBER  OE  USERS 


Table  1  shows  the  results  of  a  comparison  between  the  optimal 
scheme  and  fixed  length  packets.  The  column  labeled  "%-difference" 
shows  the  percentage  improvement  in  M  using  optimal  packet  lengths 
(y*)  rather  than  the  best  fixed  length  (yfjxecj)-  Figure  6  shows  this 
comparison  for  H=6.  As  can  be  seen,  the  improvement  is  guite 

significant  in  some  cases.  In  particular,  this  difference  is  most 
significant  for  small  delay  and  large  H. 

Note  that  the  difference  in  performance  is  not  monotone.  In 

particular,  as  the  allowable  delay  increases,  y*  increases.  As  y*  takes 
a  larger  fractional  part,  M*  increases.  The  fixed  length  scheme, 

however,  cannot  do  this  and  so  Mfixecj  increases  only  very  slightly. 
The  rise  in  M*  with  increasing  y*  is  due  to  the  decrease  in  header 
overhead  as  the  packet  length  increases.  The  small  increase  in  Mfjxecj 
is  due  to  the  slight  increase  in  utilization  permitted  by  increasing  the 
allowable  delay. 

The  column  labeled  in  Table  1  and  shown  in  Figure  1  is  an 

upper  bound  on  M  computed  from  the  simple  observations  that  the 

packetization  delay  must  be  smaller  than  the  total  allowable  delay,  D, 

and  the  packetization  delay  is  at  least  .  This  puts  an  upper  bound 

zp 

on  y : 


y  §  2p  D  +  1 


An  upper  bound  on  M  is  then  obtained  by  observing  that  the 
utilization  of  the  high  speed  channel  is  bounded  by  1: 


Ms  *  -X- 

p  y+H 
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Thus 


♦ 


* 


<  r  2pD+l_ 

~  p  2pD+l+H 

As  we  can  see,  this  bound  is  tightest  for  large  D  and  small  H.  It  is 
sometimes  useful  in  quickly  estiating  M*  to  compare  it  with  ^  which 

is  easily  estimated  by  equations  2  and  3  and  restricting  y  to  integers. 


f 
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VI .  Conclusions 

We  have  presented  a  packetization  strategy  along  with  a  proof  of 
its  optimality.  The  technique  is  easily  implemented  in  practice.  A 
comparison  with  fixed  length  packets  is  presented  and  the  optimal 
technique  exhibits  significantly  higher  throughputs,  in  some  cases  over 
50%  higher.  We  conclude  therefore  that  the  packetization  strategy 
should  be  used,  in  practice,  especially  when  the  delay  constraint  is 
tight  and  the  header  size  is  large. 
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Optimal  Fixed  Frame  Multiplexing  in  Integrated 
Line-  and  Packet-Switched 
Communication  Networks 

BASIL  S.  MAGLARIS.  mimbir.  imp.  and  MISCHA  SCHWARTZ,  fh.i.ow.  irtp 


Abstract — Recenllv.  emphasis  has  been  placed  on  integrated  communi¬ 
cation  facilities  capable  of  handling  both  line-switching  and  packet-switch¬ 
ing  digital  traffic.  The  problem  of  dsnamicalls  allocating  the  bandwidth  of  a 
trunk  to  both  tv  pcs  of  traffic  is  formulated  as  a  Markovian  decision 
process.  Line  switching  is  modeled  as  a  lime  division  multiplexing  loss 
scheme  over  a  varxing  portion  of  a  fixed  lime  frame.  Packet-switching 
traffic  is  served  through  the  remaining  portion  of  the  frame  and  requires 
queueing  at  the  multiplexer-concentrator.  Two  different  cost  criteria  are 
examined  involving  probabilitv  of  blocking  for  line  switching  and  average 
queueing  delav  for  packets.  The  corresponding  optimization  problems  are 
presented  under  reasonable  simplifving  assumptions.  The  movable  boundary 
scheme  suggested  for  commercial  implementation  of  integrated  multi¬ 
plexers  is  shown  to  offer  optimal  or  near-optimal  performance. 

I.  Introduction 

N  ORDER  to  multiplex  and  concentrate  different 
data-sending  facilities  over  a  large  bandwidth  channel, 
a  choice  of  a  switching  technique  must  be  made.  Line 
switching  is  used  for  transmission  of  long  messages  and 
requires  a  permanent  connection  between  the  communicat¬ 
ing  ends  for  the  duration  of  the  session.  This  connection 
can  be  a  physical  line,  a  synchronous  slot  allocation  in  a 
time  division  multiplexing  scheme,  or  a  frequency  band 
allocation  in  frequency  division  mulupl  xing.  Initial  setup 
times  must  be  short  compared  to  the  message  length  for 
efficient  channel  utilization.  Most  of  the  actual 
line-switched  systems  are  loss  systems,  i.e..  a  request  for  a 
new  line  is  blocked  when  no  bandwidth  is  available,  and  a 
new  trial  must  be  made  later.  The  probability  of  blocking  a 
new  request  is  a  measure  of  the  performance  of  the  system 

For  short  interactive  types  of  traffic,  messages  arriving 
at  the  concentrator  wait  in  a  queue  (either  as  whole  mes¬ 
sages  or  divided  into  smaller  portions  called  packets)  and 
arc  served  according  to  some  discipline  A  header  in  the 
message  or  packet  is  required  containing  identification  and 
destination  information.  Routing  and  reassembly  of  mes¬ 
sages  at  the  destination  node  are  additional  problems  in 
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such  systems.  A  measure  of  the  performance  of  the  system 
is  the  average  queueing  delay  experienced  by  an  average 
message. 

Closs  in  [1]  presented  a  comparative  evaluation  of  line- 
and  packet-switching  lossless  schemes  and  showed  the  su¬ 
periority  of  the  former  for  lengthy  message  traffic  and  of 
the  latter  for  bursty  short  packet  traffic.  This  basic  result 
has  been  further  demonstrated  in  [2)-[4)  for  various  classes 
of  models. 

In  a  more  general-purpose  digital  data  communication 
network,  it  is  desirable  to  integrate  both  line-  and  packet- 
switching  facilities.  The  user  or  the  network  manager  may 
decide  on  the  most  suitable  discipline.  This  integrated 
facility  can  accommodate  simultaneously  line-switching 
traffic  such  as  voice  traffic  or  computer  batch  traffic,  and 
packet-switching  data  traffic,  such  as  bursty  terminal-to- 
host  traffic. 

Various  schemes  have  been  studied  and  implemented. 
Fixed  frame  fixed  boundary  schemes  allocate  a  pre¬ 
determined  portion  of  a  lime  frame  (consisting  of  a  given 
number  of  constant  duration  slots)  to  each  type  of  traffic. 
Fixed  frame  movable  boundary  schemes  improv  e  the  packet 
traffic  throughput  by  introducing  some  intelligence  to  the 
previous  scheme.  Packet  traffic  is  allowed  to  occupy  any 
idle  slot  of  the  line  switching  portion  of  the  frame. 
The  movable  boundary  scheme  was  first  analyzed  by 
Kummerle  [5)  and  an  implementation  research  project  has 
been  undertaken  by  IBM  Zurich  |6]-[8]  The  proposed 
architecture  is  summarized  in  [9). 

A  parallel  effort  was  motivated  by  ihe  U.S.  Depariment 
of  Defense  aimed  at  replacing  the  ALTOVON 
line-switched  voice  network  and  AUTODIN  packet- 
switched  data  network  bv  an  integrated  structure.  Various 
alternatives  have  been  considered  bv  the  Defense  Com¬ 
munications  Agency  (DCA)  and  i is  contractors.  These 
include  either  movable  boundary  protocols  1 1 0] - [  !2|  or  the 
use  of  packetized  voice  techniques  to  eliminate  line  switch¬ 
ing  [  1 3)- [  15]  Commercial  vendors  (such  as  TRAN  Cor¬ 
poration  (16]  and  Codex  Corporation  [17])  have  announced 
integrated  approaches.  The  Codex  approach  uses  variable 
frame  multiplexing  with  the  frame  size  adjusted  to  the 
traffic  variations.  Analyses  of  variable  frame  integrated 
multiplexers  can  he  found  in  (IH|  and  |lg| 

The  growing  interest  in  integrated  line-  and  packet- 
switched  networks,  as  demonstrated  m  the  foregoing  survey. 
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and  specifically  the  efforts  to  improve  the  efficiency  til 
these  systems  in  terms  of  the  trunk  capacity  utilization,  led 
to  the  study  presented  in  this  paper.  Its  aim  is  io  investi¬ 
gate  the  limit  of  the  capacity  utilization  improvement  un¬ 
der  a  basic  constraint:  fixed  frame  size  and  synchronous 
service  provision  to  line-switching  sessions  once  thev  get 
access  to  the  system.  A  loss  model  is  assumed  for  the 
line-switching  traffic  and  a  first-in- first-out  queueing 
model  for  the  fixed  size  packet  traffic.  The  problem  of 
optimal  allocation  of  frame  slots  is  formulated  as  a 
Markovian  decision  process.  Two  optimality  criteria  are 
examined.  Under  the  first,  the  average  time  delav  for  the 
packet  traffic  is  minimized  with  the  probability  of  blocking 
a  new  line-switching  arrival  constrained  to  be  below  a 
given  level.  A  linear  programming  method  is  used  to  de¬ 
termine  the  optimal  policy  under  a  birth-death  process 
model  for  the  line-switching  traffic.  Under  the  second 
criterion,  a  linear  combination  of  the  average  time  delay 
and  the  probability  of  blocking  is  minimized.  A  policy 
improvement  algorithm  is  used  to  obtain  the  optimal  poli¬ 
cies  both  under  the  same  birth-death  assumption  and  for 
the  general  case  of  multiple  arrivals  or  completions  in  a 
frame.  It  is  shown  that  under  the  birth-death  assumption, 
the  movable  boundary  scheme  is  in  some  cases  optimum  or 
close  to  optimum  in  the  examples  studied.  Similar  results 
are  obtained  for  the  general  case. 

II.  The  Model 


A.  Input  Traffic 

Line-switching  traffic  is  assumed  to  consist  of  messages 
arriving  with  a  Poisson  rate  of  X,  messages/s.  Message 
length  is  exponentially  distributed  with  mean  1  /ji  s.  A  new 
arrival  may  either  be  accepted  or  blocked.  Packet-switching 
traffic  consists  of  fixed  length  packets  arriving  at  a  Poisson 
rate  of  X2  packets/s  and  stored  in  a  finite  queue  of 
capacity  Q  packets.  Q  is  assumed  to  be  large  enough  to 
result  in  negligible  packet  overflow. 

B.  Output  Trunk 

The  incoming  traffic  of  both  types  is  multiplexed  onto  a 
main  trunk  of  capacity  C  packeis/s.  A  frame  of  b  s 
duration,  divided  into  M  slots  as  in  Fig.  1.  accepts  both  the 
line-switching  and  packet-switching  traffic.  The  slot  size  is 
chosen  to  accommodate  exactly  one  packet  Hence  C  = 
M /b.  S/  slots  in  the  / th  frame  (Fig.  I)  are  allocated  to  the 
line-switching  messages,  one  slot  per  message,  in  a  time 
division  multiplexing  mode  (every  frame  processes  a  frac¬ 
tion  of  a  message  equal  to  the  packet  or  slot  size).  The 
multiplexer  must  guarantee  no  interruption  of  synchronous 
service  to  the  message  already  in  the  system.  A  new  setup 
call  is  stored  and  considered  for  acceptance  to  the  trunk  at 
the  beginning  of  a  new  frame  The  remaining  M  -  .V  -  A 
slots  are  allocated  to  the  packets  present  in  the  packet 
queue  at  the  beginning  of  the  yth  frame,  in  a  first-in  I  n  st¬ 
out  mode.  The  object  is  to  determine  the  optimum  alloca- 
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lion  of  Sl  and  .V  to  circuit  and  packet  traffic,  respectively. 
As  noted  earlier,  two  optimality  criteria  are  investigated  for 
this  purpose. 

C.  Decision  Mechanism — Stale  of  the  System 

We  define  the  state  of  the  system  X ,  at  the  time  instant 
just  prior  to  the  opening  of  the  y  th  frame  as  follows: 

*z  = 


where 

Lj  number  of  packets  waiting  in  the  packet  queue. 
r;  number  of  messages  already  in  the  system  and  re¬ 
quiring  continuation  of  service,  and 
number  of  new  setup  calls  during  they  -  1  frame. 

The  multiplexer  decides  on  Sr  .V  based  on  ,Y(.  the  previous 
history  of  the  system,  and  some  optimizing  rule.  Continua¬ 
tion  of  service  for  the  line-switching  traffic  must  be 
guaranteed  and  any  portion  of  the  frame  not  used  by  long 
messages  must  be  allocated  to  packet-switching  traffic. 
Hence  r  <  S,  <  min  ( r(  +  »r  M ). 

The  transition  probabilities  from  a  present  state  Xt  to  a 
new  state  X  ,  depend  only  on  X,  and  the  number  of  slots 
allocated  to  the  line-switching  traffic  of  the y  th  frame  A',.  In 
order  to  derive  them,  we  need  the  probability  distributions 
of  the  components  of  state  A"( .  , .  These  are  given  b\  ( I )  - 1 3 > 
below  and  are  obtained  bv  noting  the  following 

vt  .  |  Number  of  new  setup  calls  during  the  /th  frame. 
The  number  follows  a  Poisson  distribution  »nh 
mean  X,/>.  (Equation  (I)  follows  directly.) 
r.,  Number  of  messages  requiring  continuation  of 
service  among  the  S;  messages  currently  in  the 
system  Since  we  assumed  that  the  message  lengths 
were  exponential,  r  . ,  depends  only  on  Sr  (This  is 
due  to  the  memorylexs  property  of  the  exponential 
distribution.)  The  binomial  distribution  of  (2)  fol¬ 
lows  directly.  (The  probability  of  a  completion  in 
a  frame  b  s  long  is  I  -  c  “A) 

Lt .  j  Depends  on  l.t  and  .V  according  to  the  transition 
equations  of  a  finite  queue  with  Poisson  arrivals 
\,b.  constant  service  b.  and  ,V  -  M  -  Sl  servers. 
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This  leads  lo  (3). 

-  ~7^~t 


A  th 


Fsfr+i)  =  rsfr.  il*,) 


(I) 


(2) 


if  Ly  <  Nt  =  A/ 


*r 


(A,6) 


(3) 


(£/+1-L,  +  A/)! 

if  Afy  <£,<£,_,+  ;V , 

0,  otherwise 

for  all  LJt,  <  Q.  Ly  <  £).  and  PS(Q  \  Lt)  =  1  - 
2?='o^,(^*i  ~  k\Lj)-  (Recall  that  L/tl'<  0.) 

Since  each  of  the  i>  , ,  r  ,,  Z./+l  random  variables  de¬ 
pends  only  on  the  previous  combination,  we  i  onclude  that 
the  slate  transition  probabilities  are  given  by 

/’s((*,+  ,l*,)  =  />(*, ml*,-5,) 

=  )/>s,('}*i)/s,(/'/-il/'/)-  (4) 


Depending  on  the  length  of  the  frame  ft  and  the  arrival  rate 
A,,  we  may  define,  within  any  acceptable  level  of  accuracy, 
the  maximum  number  of  new  calls  per  frame  CMAX  such 
that  P(vt  >  CMAX)  r?  0.  Thus  t>r  r;,  L/  may  assume  a  finite 
number  of  values  each,  and  the  process  has  a  finite  state 
space. 

The  transitions  (4)  define  a  finite  slate,  discrete  time 
Markov  process  with  a  decision  mechanism  controlling  the 
evolution  of  the  process.  Note  that  the  vector  state  rep¬ 
resentation  does  not  lead  to  the  erroneous  implicit  inde¬ 
pendence  of  its  components  as  in  ( 20).  reported  in  [21 J.  The 
theory  of  Markovian  decision  processes  (see  [22])  provides 
us  with  the  analytical  and  computational  tools  for  the 
evaluation  of  an  optimal  policy  r.  i.e..  an  assignment  rule, 
possibly  probabilistic,  of  an  action  5,  (the  number  of  slots 
to  be  assigned  to  line-switched  traffic)  to  a  slate  X/  based 
on  the  history  of  the  process 


III.  First  Optimality  Critfrion 

The  first  optimality  criterion  we  consider  is  that  of 
minimizing  the  average  packet  queueing  delay  TJ  with  the 
probability  PI  of  blocking  newly  arriving  line-switching 
calls  constrained  to  be  no  more  than  a  specified  acceptable 
level  PI. OSS. 

By  Little's  formula  |23]  the  packet  delay  is  proportional 
to  the  average  packet  queue  size  /.(  I  ).  The  policy  of 
assignment  rule  w  that  minimizes  /.(/.)  is  identical  with 
one  that  mininn/es  Td.  and  thus  we  focus  on  /;( I.)  only 
PI.  henceforth  called  loss  probability,  mav  also  he  written 


in  terms  of  expectations  of  the  various  parameters  previ¬ 
ously  determined  in  Section  II  (  he  number  of  luu  switched 
calls  turned  away  in  a  frame  is  z  »  r  .S' .  the  excess  ol 
calls  over  the  slots  made  available.  PI  is  the  ratio  ol  the 
average  of  this  quantity  to  the  average  number  of  call'  in  a 
frame.  \{b.  Using  subscript  it  to  denote  policy  w.  PI  is 
given  by 


„  -  £-<r  *  r  -  5-> 

A,* 

The  optimality  criterion  is  thus  formulated  as 
mjn  (£„(L  )}. 


(5) 


given  the  transitions  (4)  with 

EJr,  +  »,-  S,) 

- - —  <  PLOSS.  (6) 

A,o 

Here  PLOSS  is  the  maximum  permitted  fraction  of  lost 
calls.  The  transition  probabilities  (4)  and  the  criterion  (6) 
define  a  finite  state  decision  process  with  minimization  of 
the  average  expected  cost  £„(  Lt)  and  a  constraint  on  the 
average  linear  action-state  combination  Ejrt  -1-  r  -  St ).  It 
turns  out  (see  [24])  that  if  rr  is  restricted  ;o  the  irreducibiliiy 
class  {rr}„.,  as  defined  in  the  Appendix,  then  there  exists  a 
stationary  policy,  assigning  actions  to  states  independently 
of  the  past  history,  which  satisfies  the  optimality  criterion 
of  (6).  It  is  shown  in  [24]  and  [25]  that  the  inequality 
constraint  in  (6)  leads,  in  general,  to  a  possibly  probabilis¬ 
tic  (i.e..  nondeterministic)  assignment  rule  for  the  optimum 
policy. 

A  considerable  simplication  of  the  problem  arises  if  we 
assume  the  frame  duration  b  is  very  short  compared  to  the 
average  message  length  1  //z.  (This  corresponds  to  the  case 
of  a  high-speed  trunk.)  This  assumption  is  identical  to  the 
one  used  in  the  Erlang-#  approximation  for  synchronous 
line-switching  traffic,  and  it  is  shown  to  be  valid  for  a  wide 
variety  of  applications  in  [26).  Linder  this  assumption,  the 
number  of  line-switching  messages  in  t hi  system  mav  not 
change  by  more  than  one  from  frame  to  frame.  This 
corresponds  to  a  birth-death  process  model  for  the  line- 
switching  traffic  There  are  thus  onlv  two  possible  actions 
for  the  optimal  policy  —  to  block  a  new  call  or  to  allocate  a 
slot  to  it.  The  three-dimensional  vectoi  A  is  then  reduced 
to  a  two-dimensional  vector  A’(  -  (  Lr  /,).  and  the  possible 
actions  may  be  written  as 

A(Lr  r; )  =  0.  if  a  new  call  is  blocked. 

•((  Z.(.  r  )  =  I.  if  a  new  call  is  given  service. 

Obviously,  a  decision  must  he  made  onlv  if  a  new  arrival 
occurs  i with  probability  A,/>!  I  hc  boundary  .V  mav.  there¬ 
fore.  either  accommodate  the  z  sessions  already  in  the 
system  or  increase  by  one  in  order  to  allocate  a  newly 
arrived  session 
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Note  that  the  probahihtv  of  loss  I'l.  will  in  this  case  he 

-S,)  rt[A(L,.rl)  <»)/•(  r,  I) 

F •  \th  ~  '  A,b 

=  Pm{A[L,.rt)=  0).  (5a) 

Under  the  birth-death  approximation,  therefore.  PIT  is  the 
actual  probability  of  loss  (or  probability  of  blocking)  for 
the  line-switched  traffic.  This  justifies  the  use  of  the  term 
probability  of  loss  for  the  average  fraction  of  (5). 

The  problem  of  minimizing  the  average  packet  queue 
length  given  the  inequality  constraint  on  the  probability  of 
blocking  new  setup  calls  can  now  he  formulated  (22.  p. 
132]  as  the  following  linear  program. 

For  every  class  of  policies  («}„ .  X,/jt  <  M’  ■£  M  (see 
the  Appendix)  determine  the  nonnegative  variables 
P(L,  r.  A)  (the  joint  probabilities  of  state  ( L.  r)  and  ac¬ 
tion  A  €  {0. 1 )  defining  the  probabilistic  state  action  as¬ 
signment)  for  0  <  L  S  Q.  0  S  r  <  A/'.  in  order  to  mini¬ 
mize 

E(L)=  2  2  2  LP(L.r.i) 

i*=0  r*0  /.* 0 

under  the  probability  of  blocking  constraint 

%  P(L,  r.  A  =  0)  S  PLOSS.  (7) 

t..r 

the  linear  transition  constraints,  and  the  probability  mea¬ 
sure  constraint  (all  probabilities  are  nonnegative  and  sum 
to  one). 

The  linear  programming  algorithm  may  be  used  to  com¬ 
pute  different  time  delay  probability  of  blocking  optimal 
pairs.  Apart  from  the  computational  complexity,  the  main 
disadvantage  of  the  method  is  the  nondeierminislic  struc¬ 
ture  of  the  resulting  policies  due  to  the  probability  of 
blocking  constraint.  It  is  thus  desirable  to  introduce  another 
optimality  criterion  leading  to  deterministic  policies.  This 
is  done  in  the  next  section.  Computational  results  for  both 
criteria  are  presented  in  Section  V. 

IV.  Second  Optimality  Criterion 

The  second  optimality  criterion,  which  leads  to  de¬ 
terministic  policies,  simply  minimizes  the  weighted  sum  of 
the  average  packet  queue  size  and  the  probability  of  loss 
for  line-switched  calls.  Specifically,  we  choose  to  minimize 

*  =  £.(L)  +  «P/W.  (8) 

By  letting  the  weight  factor  a  vary  (0  S  u  s  x).  different 
optimal  pairs  (  En(L).  Plr )  may  be  obtained  that  minimize 
%.  It  can  be  shown  |27|  that  these  pairs  correspond  to 
points  on  the  optimal  average  queue  length  loss  probabil¬ 
ity  curve  (the  first  criterion)  as  well.  This  second  criterion 
thus  also  serves  as  a  way  of  computing  some  points  on  the 
curve  for  the  first.  By  incorporating  the  previous  constraint 
on  loss  probability  in  the  average  eivst  function,  the  prob¬ 
lem  turns  oul  lo  be  identical  to  a  finite  state  Markov 
decision  pntcess  with  average  cost  minimization.  Ii  follows 
|22|  that  for  every  irreducible  class  of  policies  (rr),,  an 


optimal  potiev  exists,  and  it  is  deterministic,  i.e..  it  has  the 
form  of  a  fixed  assignment  of  an  action  to  every  stale.  We 
first  consider  the  birth  death  approximate  model  of  Sec¬ 
tion  III.  then  take  the  exact  model  with  no  such  approxi¬ 
mation  made. 

A.  The  Birth -Death  Approximation 
Let  us  define  the  one-period  cost  C(X.  S )  as  follows: 
CiX.S)  =  C(L.  r.A)  i  L  +  a(l  -  A). 

_  Jo.  if  ( L.  r)  is  a  blocking  state. 

[  I .  if  ( L,  r)  is  an  allowing  state. 

Then  (see  (22.  p.  141]).  the  average  cost  g  will  he  indepen¬ 
dent  of  the  initial  state  ( L,,.  r0)  =  X0  and  given  by 

«-■*  „  n  *■  t 


For  every  class  of  policies  (*}M .  the  transitions  (4)  and  the 
average  expected  cost  (8)  define  a  finite  state  Markov 
decision  process  with  average  cost  minimization  and  valid 
irreducibility  condition.  Therefore,  from  (22.  theorem  6.17] 
we  conclude  that  an  optimal  policy  it  exists  within  every 
class  {tr}Af.  and  it  is  stationary  deterministic.  Equivalently, 
there  exists  a  set  of  bounded  functions  h(A C)  =  h(  /..  r), 
0  <  r  <  M\0  <  L  <  Q  and  a  fixed  number  g  such  that 

g  -  h(L,r) 

=  min  { C(L.r.A) 

AC  (O.l)  \ 

+  2  /»( L\  r’ |  L.r.A)h(L\  r')).  (9) 

t:.r  • 


where 


C(  L.  r.  A)  =  L  +  a(l  -  A) 


P(L\  r’  |  L.  r,  A) 

f  P,(L'\L)P'(r’). 

/*,.<(  L'  |  L  )Pr,  <(/•'), 


if  no  arrivals  occur, 
if  one  arrival  occurs. 


Again.  /’,(/.' )  L)  is  given  by  (3).  while  under  the  birth- 
death  assumption 

probability  of  no  arrivals  =  I  -  A, A. 

probability  of  one  arrival  =  A, A, 

phs.  if  r’  =  j  -  I  (one  departure). 

P\>  ')  =  1  -  A.r.  if  r'  =  s  (no departure). 

0.  otherwise. 

The  minimization  of  the  right  side  of  (V)  is  carried  out 
under  two  distinct  options,  a)  the  state  t  /..  r )  blocks  a  new 
arrival,  or  b)  u  allows  a  new  arrival  in  the  system.  Thus(9| 
defines  the  deterministic  optima!  policy  within  the  class 
{*!„.  Hie  global  optimal  policy  must  be  selected  among 
the  optimal  policies  of  .ill  classes,  with  A,  p  <  A/ "  ■-  SI. 
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F.quation  (4)  cun  he  used  to  carrv  out  u  search  algorithm 
for  the  iletcrinmution  of  the  opnnut  police  1  ho  aleoi  ithm. 
referred  to  as  the  police  improvement  algorithm  (see  |22.  p 
I50|)  sturt.s  with  un  arbitrary  miii.il  pi>lic\.  solves  the 
resulting  optimulitv  equation  lor  hil..ri  and  g-  and 
searches  for  stale-action  assignment  alternatives  lending  to 
a  smaller  cost  under  the  same  In  l..  r )  I  he  improved 
policy  is  set  for  the  new  iteration  until  no  improvement  can 
be  found.  The  advantage  of  this  algorithm  compared  to  the 
linear  programming  approach  is  that  it  searches  onlv  for  a 
deterministic  optimal  policy  liquation  (9)  may  he  used  to 
derive  some  properties  of  the  optimal  policy,  in  order  to 
reduce  the  amount  of  searching  and  the  computational 
complexity  of  the  policy  improvement  algorithm. 

Optimality  ami  the  Movable  Boundary:  The  optimal 
policy  discussed  above  can  be  represented  as  a  function 
/( r ).  where  l(r)  is  the  minimum  /.  for  which  t  L.  r)  is  a 
blocking  state.  In  Fig.  2  the  transition  diagram  for  the 
birth-death  line-switching  traffic  case  is  given.  The  dwell 
time  Tr  (the  number  of  transitions  the  process  stays  in 
state  r)  will  have  a  mean  value 

Ft  Tr )  = - ! - > - ! - »  | 

pbr  +  A ,/>/»(  L<  t(r))  iiMh  +  \tb 

(see  [28]).  It  is.  therefore,  reasonable  to  assume  that  the 
packet  queue  size  is  a  much  faster  changing  process  and 
hence  reaches  equilibrium  within  Tr.  The  factor  bv  which 
equilibrium  is  reached  independently  of  the  initial  state 
depends  on  the  eigenvalues  of  the  one  step  transition 
matrix  for  the  queue  state  equation.  It  is  shown  in  (22|  that 
the  process  “shrinks'’  to  its  equilibrium  by  a  factor  equal  to 
the  product  of  the  absolute  values  of  these  eigenvalues,  the 
so-called  shrinkage  factor.  Typical  values  of  the  shrinkage 
factor  for  this  particular  queue  model  range  from  10  to 
KT60. 

Under  the  above  reasoning,  the  expected  queue  length 
will  depend  only  on  the  stationary  probabilities  P,(ry. 

v 

£„(£)  =  2  E(L\r)PJr). 

r  -I) 

with  £(  L/r)  independent  of  rr. 

The  Markov  decision  process  that  minimizes  the  average 
cost  (8)  under  these  conditions  is  thus  defined  by  the  state 
rr  decisions  A  ■=  0.  I.  and  transitions  as  in  Fig.  2.  The 
optimal  policy  tr  will  again  be  deterministic;  i.c..  the  set  of 
states  is  partitioned  into  two  subsets  through  a  boundary 
A/':  if  r  <  A/',  a  new  setup  call  is  served,  if  r  ST .  a  new 
call  is  blocked.  This  is  identical  to  the  movable  boundarv 
scheme  described  in  the  introduction.  ST  will  depend  on 
the  value  of  u.  the  weight  factor  Note  that  not  all  movable 
boundary  schemes  may  correspond  to  an  n-  optimal  scheme 

B.  The  Exact  Model 

We  now-  proceed  to  the  exact  min'd  analvsts  under  the 
second  optim.tlitv  criterion.  Since  the  number  of  arrivals  or 
departures  is  not  restricted  to  one.  the  results  of  Section 
IV-A  under  the  birth  death  assumption  arc  not  applicable. 
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l  sing  the  state  notation  of  Section  ll-C  and  (5)  for  VI.  in 
(X).  we  formulate  the  finite  slate  .Markov  decision  process 
with  transitions  <  I )  (4)  and  average  expected  cost  minimi¬ 
zation  as 

+  5')|  --v.  («») 

From  (2)  (derived  from  the  memoryless  property  of  the 
exponential  distribution),  we  may  easily  show  that 

-  2 /*(  V,  |  S7  )/>„(£,)  =«-  ^EJS,). 

v, 

Assuming  irreducibility  with  rr  belonging  to  an  ineducihil- 
itv  class  { -s- )  w  .  the  process  achieves  equilibrium  and  £_i  r ) 
=  e  uhEmt  S )  or 

£.(S)  =  *■"*£.(')-  (II) 

Redefining  the  weight  a  —  ( a / )(el,h  -  I)  and 
g  —  g  —  a  we  have,  front  (10)  and  (II).  as  the  equivalent 
cost  function  to  be  minimized: 

g  -  min  {£„(£)-  aET(r )} .  a  >  0.  (12) 

*  V.  I  "  J  » 

liquations  (l)-(4)  and  the  cost  function  1 12)  guarantee  the 
existence  and  deterministic  nature  of  the  optimal  policies 
within  the  irreducibility  classes  {rr  j„  .  These  policies  assign 
the  number  of  line-switched  slots  in  a  frame  S  such  that 
rs.S£  min  ( r  -t-  e.  AT),  based  on  the  state  X  =  (  L.  r.  r ): 
X  -  S. 

The  average  expected  cost  in  (12)  suggests  that  the 
one-period  cost  C(  X.  S )  is 

C(X.S)  =  C(L.r)  -  L  -  nr.  (12) 

The  optimality  equations  can  be  simplified  by  reducing 
the  three-dimensional  slate  space  to  a  two-dimensional 
i>ne.  After  some  algebra,  it  is  concluded  that  the  optimality 
equations  are  defined  over  the  set  of  bounded  functions 
/< '<  /..  r  )  satisfying 

c  MAX 

«  r  h't  L.  r)  -  l.  -  ar  +  ^  P(  v  ) 

r  0 

I  v  V  1 

•  min  2  V  />,(#•’)/>.(  £'|  £)//  (  £’.£) 

I  '•  V-  (  '  "  '  »  I 

(14) 

with  0  r  l.  5.  Q.  0  <■  r  '■  S.  and  0  <  r  -  ('MAX  (Recall 
that  (‘MAX  is  i lie  maximum  number  of  calls  per  frame' m 
the  truncated  Poisson  arrival  piivcss.)  Probabilities  /*(  i-  ). 
/’,( r  ),  /’(r  ).  /M/.  J/. )  are  given  bv  (I).  (2l.  and  (2). 
rcspectivelv  lhe  pi'licv  improvement  algorithm  ileluied  b\ 

(  14)  billow  a. 
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Let  ll(S.  /. )  denote  the  function  to  he  minimized  in 
(14): 

c'  ' 

fHS.L)  =  £  2  />  )/•</.•  | /.)/;  < /.\a).  (15) 

/  n  >  n 

Within  the  irreducibility  class  (cjw  .  we  denote  a  determin¬ 
istic  policy  by  the  state-action  mapping  .1  w  ( /..  r.  »•)  --  S. 

The  algorithm  is  then  formulated  as  follows:  for  every 
class  of  policies  (ir)„ .  X,  p  <  M'  <  A/. 

Step  I:  Initialize  to  the  movable  boundary  policy: 
Am.(L.  r.  v)  -  min(r  +  v.  A/'). 

Step  2:  Solve  the  linear  system  for  g  and  h( L.  r). 

g  +  h‘(  L.  r)  =  L  -  ar 

CMAX  V  v 

+  2  /»(*')  2  2  PSr')Pt[L‘\L)h(L\r'). 

r-tl  /.'  -0  r  0 

V(L.r). 

with  S  =  Am(L.  r.  n).  /T(0.0)  =  0. 

Step  J:  Compute  H{S.  L)  of  (15)  for  0  S  S  S  A/'.  0  < 
L<Q. 

Step  4:  For  every  ( L.r.v ).  v  >  I.  r<M\  find  the 
integer  S  in  the  interval  r  s  S  <  min  [r  +  v.  M'}  which 
minimizes  H(S.  L ).  Denote  this  S  bv  ,4'w  (  L.  r.  v). 

Step  5:  If  A\r(L.  r.  v)  *  Axr{L.  r.  v)  for  some  states 
(L.  r.  *-).  set  Au{  L.r.v)  —  A\,(  L.r.v )  and  go  to  2.  If 
not.  stop.  Select  the  policy  A  u{  L.r.v )  resulting  in  the 
minimum  g  for  all  AT.  X,/p  <  AT  <  \1. 

The  optimal  policy,  in  this  general  case,  does  not  neces¬ 
sarily  have  a  movable  boundary  format  as  it  did  for  the 
birth-death  approximation.  It  has  been  found  in  all  the 
examples  we  have  worked  out  that  H(S.  L)  defined  by  ( 15) 
is  a  convex  function  of  S  for  all  /..  0  <  L  S  Q.  achieving  a 
minimum  Smn{  L )  within  0<5<  M'  (see  Fig.  3).  Then  the 
optimal  policy  will  assign  to  every  state  (L.  r.v)  the  action 
S  with  S  the  closest  integer  to  5,nin(Z.)  satisfying  the 
condition  r  <  S  £  min  [r  +  v.  A/'},  or 

(Smio(L).  if  rSSmJL)Sr  +  v. 

(  L.  r ,  v)  —  S  =  |  r.  ,ir>SmJL). 

/+*'•  if  r+v<S„,JL). 

The  seemingly  complex  policy  thus  reduces  for  convex 
H(S.  L)  to  a  simple  one  that  depends  on  the  packet  queue 
size  L.  The  other  components  of  the  state  vector  X  = 
{L.r.v)  are  taken  into  account  in  simple  fixed  point 
comparisons  and  additions.  The  multiplexer  needs  to  main¬ 
tain  in  its  Uveal  storage  only  the  table  SIU  J  L )  of  size 
0+1. 

We  have  not  been  able  to  prove  formally  the  convexity 
of  H{S.  /. ).  However,  the  intuitive  understanding  of  the 
optimal  policy  formal  and  our  computational  experience 
suggest  that,  for  all  practical  purposes,  the  statement  above 
is  valid 
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V.  Numiricai  Ri.sui.is 

A.  Birth -Death  Approximation 

Two  numerical  examples  were  studied  under  this  ap¬ 
proximation.  The  linear  programming  and  policy  improve¬ 
ment  algorithms  were  implemented  on  a  computer.  The 
policy  improvement  algorithm  led  to  movable  boundary 
schemes  as  expected,  within  at  most  three  iterations.  Linear 
programming  was  applied  only  to  the  first  example  (con¬ 
sisting  of  12b  variables  and  66  constraints)  since  the  size  of 
the  second  example  (386  variables  and  147  constraints)  did 
not  permit  convergence  of  the  modified  simplex  method 
used.  The  two  examples  follow. 

Case  I:  The  following  parameters  were  used  in  this  case: 

X,  =  2.0  messages/s.  I/p  =  I  s.  p,  =  X,/p  =  2. 

X:  =  2000  packets/s.  p,  =  \,/>  =  2. 

h  -  0.001  s.  M  =  6  slots/frame.  Q  -  8  packets. 

In  Fig.  4  we  have  plotted  the  resultant  loss  probability 
versus  the  average  queue  length.  Points  resulting  from 
{£(  L  \  +  aPl }  minimizations  (small  circles)  belong  to  the 
optimal  curve  (min  FAD.  Pis  A'}  extrapolated  front  the 
linear  programming  results  (black  circles).  The  former 
points  are  achieved  through  movable  boundary  policies 
( Af  =  3.4.6).  whereas  the  latter  correspond  to  nonde- 
terminisiic  schemes.  We  include  the  movable  boundar> 
policy  with  M'  =  5.  which  does  not  belong  to  the  optimal 
curve.  It  is.  however,  very  close  to  that  curve,  offering 
excellent  (suboptimal)  performance.  In  order  to  check  the 
validity  of  the  reasoning  in  Section  IV-A  concerning  the 
fast  evolution  of  the  packet  process  compared  to  the  line- 
switching  slot  number  variations,  the  shrinkage  factor  for 
transition  matrix  /*(  /.  |  /.')  was  evaluated.  It  was  found  to 
range  from  0.26  *  10  -l'  for  S  =  I  up  to  0.22  ■  10  i:  lor 
S  -  6.  The  values  of  this  factor  justify  the  fast  evolution 
assumption.  As  a  further  check,  the  stationary  conditional 
expectations  Ft  I.  .V)  were  used  to  compute  £"(  L)  and  PI 
for  the  movable  boundary  schemes  with  A/'  r  3.4. 5.6. 
The  steady-state  probabilities  PaS)  were  assumed  to  be 
given  by  the  Frlang-fl  probabilities.  A  comparison  with  the 
polio  improvement  exact  results  showed  that  the  two 
methods  agreed  within  two  decimal  digits 

Ci/ve  2  We  lepeated  the  calculations  above  (except  lor 
the  linear  programming  approach)  for  a  larger  stale-space 
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example: 


A,  =4.0  messages /s. 

I/M  =  1  s. 

Pi  =  A,/>  =  4. 

A  2  =  4000  packets/s. 

II 

>" 

la 

<> 

II 

b  =  0.001  s. 

M  =  10. 

Q-  12  packets. 

Results  are  plotted  in  Fig.  5.  Again,  all  movable  boundary 
policies  either  belong  to  the  optimal  curve  or  are  quite 
close  to  it.  The  validity  of  the  fast  evolution  assumption 
was  checked  up  to  two  decimal  digits. 

As  a  conclusion,  under  the  birth-death  assumption  the 
movable  boundary  schemes,  which  turn  out  to  be  quite 
dense  in  the  region  of  interest  (  PI  <  0.05),  offer  optimal  or 
near-optimal  performance. 

B  The  Exact  Mi  Kiel 

The  policy  improvement  algorithm  of  Section  IV-B  was 
implemented  for  the  general  case  (the  number  of  arrivals 
and  departures  are  permitted  to  have  any  salue).  The 
algorithm  is  used  to  determine  the  limitations  of  the 
birth-  death  assumption  and  to  derive  the  properties  of  the 
optimal  policy. 

I)  The  Validity  of  the  Birth -Death  Assumption  We  ap¬ 
plied  the  algorithm  to  the  example  of  case  I  in  Section  V-  A 
with  the  parameter  n  =  3.  leading  to  an  optimal  boundary 
scheme  with  M'  -  4.  The  exact  optimization  was  carried 
out  by  varying  the  frame  length  h  from  n.OOl  to  0.1  It 
turns  out  that  the  general  policy  improvement  algorithm 
leads  to  the  same  optimal  schemes  t movable  boundary  with 
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M'  —  4)  and  performance  measures  as  the  simplified  one 
under  the  birth-death  assumption,  with  values  of  h  less 
than  0.01.  For  larger  frame  durations  (lower  speed  trunks) 
the  optimal  deterministic  schemes  are  different  from  the 
movable  boundary  ones,  having  the  table  look-up  form 
described  in  Section  IV-B. 

2)  Policy  Improvement.  Case  J.  The  following  parame¬ 
ters  were  selected  for  the  application  of  the  general  policy 
improvement  algorithm: 

A,  =  0.6  messages/ s.  l/>  =  3.33  s.  P,  =  2. 

A;  =  2.0  packets/s.  p:  =  2. 

b  =  1  s.  M  =  6  slots/frame.  Q  -  8  packets. 

This  case  has  the  average  message  length  three  times  the 
frame  size,  hence  the  birth-death  assumption  is  not  valid. 
Optimal  deterministic  policies  were  computed  for  different 
values  of  o.  The  corresponding  average  queue  length  £'(  /. ) 
and  probability  of  loss  PI  have  been  plotted  in  Fig.  6.  For 
comparison,  movable  boundary  schemes  were  evaluated  as 
well  (small  triangles  in  the  figure).  As  expected.  /.'( /.  i  is 
increasing,  and  PI  is  decreasing  with  the  weight  «.  L  nlike 
the  birth- death  case,  the  optimal  points  found  by  varvmg 
a  form  a  rather  dense  or  closely  spaced  set.  Hence  there  is 
no  need  for  the  complicated  linear  program  to  determine 
the  optimal  curse.  For  a  given  level  of  loss  probability,  a 
deterministic  scheme  or.  equivalently,  a  weight  «t  can  be 
found  minimizing  the  average  queue  length.  This  flexibility 
in  adjusting  the  multiplexer  performance  to  an  optimal 
combination  of  line-switching  probability  ot  loss  and 
packet-switching  time  delay  is  the  main  advantage  of  the 
method. 

The  optimal  policies  were  found  to  have  the  form  de¬ 
scribed  in  Section  IV.-H.  t.c..  foi  everv  /..  the  number  ol 
packets  in  queue,  a  number  Smjl.)  exists  such  that  the 
optimal  policy  assigns  actions  0  •  .S'  •  M  to  state  t  r.  r) 
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as  follows: 

SmJL).  if  r<SmjL)<r  +  *. 

S=  r.  if  SmjL)<r, 

/  +  *.  if  r  +  »-  <  Smm(  Z. ). 

As  previously,  r  is  the  number  of  line-switched  session* 
requiring  continuation  of  service,  r  represents  the  new 
setup  line-switched  calls  per  frame,  and  5  is  the  number  of 
slots  allocated  to  the  line-switched  traffic.  In  Table  I.  we 
show  Smm(  L )  as  found  for  the  values  of  a  used  above.  As 
expected.  Smn(L)  is.  in  general,  increasing  with  a  (i.e.. 
more  emphasis  is  given  to  the  probability  of  loss)  and 
decreasing  with  L.  Note  that  in  the  case  of  a  >  3.25  and 
L  >  5.  this  pattern  is  violated  (see  lower  right  part  of  Table 
1).  This  is  mainly  due  to  the  finite  queue  length  assump¬ 
tion.  which  imposes  boundary  conditions  on  the  optimiza¬ 
tion  problem.  Note  that  the  packet  queue  overflow  proba¬ 
bilities  did  not  exceed  5  percent  for  the  selected  queue  size 
0  =  8. 

Comparing  with  the  movable  boundary  policy,  we  note 
that  the  improvement  in  the  expected  queue  length  vari¬ 
ance  provided  by  the  optimal  policy  is  not  significant  over 
the  movable  boundary  scheme  for  comparable  values  of 
loss  probability.  (Note  that  the  expected  queue  length 
coordinates  in  Fig.  6  are  greatly  expanded.) 

In  order  to  further  illustrate  the  comparative  perfor¬ 
mance  of  the  optimal  and  the  movable  boundary  schemes, 
we  evaluated  their  performance  under  variations  of  the 
input  line-switched  traffic  rate  A,.  We  constrained  ihe 
line-switching  loss  probability  PI  to  a  maximum  acceptable 
level  of  2.8  percent  arid  computed  the  corresponding  per¬ 
formance  of  the  optimal  and  movable  boundarv  schemes. 
Every  optimal  policy  search  involved  several  runs  lor  the 
determination  of  the  weight  providing  a  loss  probability 
closest  to  the  2.8-pcrcent  figure.  For  comparison,  the  per¬ 
formance  of  the  fixed  boundary  scheme  (see  |20|)  was 
evaluated  as  well.  This  scheme  allocates  two  predetermined 
portions  of  the  frame  to  the  two  traffic  categories  without 
permitting  the  packet  traffic  to  occupy  any  idle  slots  ir.  the 
line-switched  portion. 

We  further  assumed  that  signaling  packets,  necessarv  for 
the  control  of  the  line-switched  sessions  (path  setup,  map¬ 
ping  of  line-switched  slots,  termination  of  calls,  etc.)  are 
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transmuted  as  ordmarv  packets;  thus  tliev  mtroi!iu.e  over¬ 
head  in  the  packet  traffic  We  approximated  this  overhead 
using  the  method  described  in  ( 1 1.  Spent ic.illv.  sse  assumed 
tlui  every  line-sw itched  session  introduces  a  packet  over¬ 
head  equal  to  A'<  A,  p)  with  K  a  parameter  depending  on 
the  signaling  protocol.  The  packet  input  rate  is  then  given 
by  A,  =  A’,  +  A(A,  /i).  with  \\  the  input  rate  of  the  data 
packets.  We  did  not  explicitly  consider  this  overhead  m  the 
example  of  case  3.  since  the  values  of  A,  and  A,  were 
assumed  fixed.  Here,  the  dependence  of  A,  on  variations  of 
A,  must  be  measured  since  we  let  A,  vary.  We  chose 
K  =  0.3  for  the  particular  calculations,  a  value  used  in  |l]. 

In  Fig.  7  we  demonstrate  the  performance  of  these 
schemes  (optimal,  movable,  and  fixed  boundarv)  with  the 
line-switched  traffic  rate  A,  varying.  It  can  be  seen  that  the 
movable  boundary  schemes  result  in  almost  optimal  packet 
queue  length  Fig.  7(a)  with  probability  of  loss  adjusted 
below  2.8  percent.  In  Fig.  7(b)  the  flexibility  of  the  optimal 
scheme  in  reaching  a  given  level  of  loss  probability  is 
demonstrated.  The  packet  input  rate  A;  variation  with  A, 
due  to  the  signaling  overhead  is  shown  in  Fig.  7(c).  Tins 
variation  introduces  the  early  queue  overflow  of  the  fixed 
boundary  scheme  in  Fig.  7(a). 

Similar  results  are  found  with  the  variation  of  the  packet 
input  rate  A,,  while  keeping  the  line-switched  traffic  rate 
A,  fixed.  A,  =0.6  messages /s  was  chosen  as  the  fixed 
parameter  in  the  calculations  here.  These  results  appear  in 
Fig.  8. 

Finally,  a  sensitivity  analysis  for  small  variations  of  A, 
around  the  reference  point  A,  =  0.6  messages  s  and  A ,  -  2 
packets  s  of  case  3  showed  no  significant  difference  be¬ 
tween  the  optimal  policy  with  a  =  2  (see  Fig.  6)  and  the 
movable  boundary  scheme  with  M'  =  4.  These  results  are 
plotted  in  Fig.  9.  Note  that  the  two  sets  of  curves  are  both 
approximately  linear  about  the  initial  operating  point  and 
track  one  another  rather  closely  over  the  range  of  variation 
of  A  |  show  n. 

VI.  Conclusion 

The  problem  of  dynamically  allocating  the  bandwidth  of 
a  trunk  to  both  line-switched  and  packet-switched  digital 
traffic  has  been  formulated  in  this  paper  a>  a  Markov 
decision  process.  Line  switching  was  modeled  as  a  time 
division  multiplexing  loss  scheme  using  a  varying  portion 
of  a  fixed  time  frame  Packet-switching  traffic  is  served  by 
the  remaining  portion  of  the  frame  and  requires  queueing 
at  the  multiplexer-concentrator. 

Two  different  cost  criteria  were  examined  involving 
probability  of  loss  for  line-switching  and  average  queueing 
delay  for  packets.  The  corresponding  optimization  prob¬ 
lems  were  presented  under  reasonable  simplifying  assump¬ 
tions. 

For  the  birth  death  approximate  model,  valid  for  high¬ 
speed  trunks  serving  sufficiently  lengths  linc-sw  netted 
sums  (thus  requiring  main  frames  lor  liansmisMon).  the 
movable  boundary  scheme  was  found  to  provide  optimal 
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points  in  the  loss  probability  average  packet  delay  plane. 
The  optimal  points,  falling  on  the  curve  connecting  the 
movable  boundary  points,  may  be  achieved  bv  more  com¬ 
plicated  and  even  nondeterministic  schemes.  Those  are. 
however,  difficult  to  define  and  to  implement. 

For  larger  values  of  the  frame  duration  (or  shorter 
time-switched  sessions!,  the  birth  death  assumption  is  not 
valid  and  optimal  deterministic  schemes  found  are  differ¬ 
ent  from  the  nun  able  boundary  ones.  An  optimal  tlcicnnm- 
mn  policy  can  be  defined  in  older  to  minimise  the  packet 
delay  with  the  loss  probability  close  enough  to  a  given 
value.  The  structure  of  such  policies  is  simple  and  mas  he 
easily  implemented  for  an  intelligent  multiplexer  with  the 


capability  of  table  look-up.  The  length  of  these  ubles 
equals  the  packet  queue  length. 

It  is.  however,  possible  to  achieve  acceptable  levels  of 
loss  probability  and  near-optimal  packet  delay  with  nun- 
able  boundary  schemes.  We  therefore  suggest  the  use  of  a 
movable  boundary  technique  for  those  integrated  systems 
requiring  a  fixed  frame.  It  offers  excellent  performance  and 
simplicity  of  implementation.  We  have  thus  shown  that  the 
movable  boundary  technique,  introduced  some  years  ago 
tn  an  iul  hoc  manner  as  a  simple  scheme  for  improving  the 
capacity  utilization  of  the  lived  boundary  procedure,  does 
in  fact  display  near-optimum  properties  in  the  sense  dis¬ 
cussed  in  this  paper. 
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Appendix 

Irreducibility  Condition 

In  order  to  proceed  in  determining  the  optimal  policy,  we  need 
to  specify  whether  the  process  exhibits  the  trrvJuahilm  condition. 
This  condition  (22)  states  that  under  any  determinism  policy 
(assigning  an  action  to  a  state  with  probability  one  regardless  of 
the  past  history  of  the  process),  the  resulting  discrete-lime  Markov 
process  consists  of  one  class  of  communicating  slates.  In  our 
case,  there  arc  policies  which  do  not  allow  the  number  of  line- 
switched  (LS)  sessions  per  frame  to  exceed  a  given  number 
M'  <  M  (For  example,  the  simple  movable  boundary  policy 
described  in  Section  I  docs  not  allow  the  LS  traffic  to  occupy  the 
whole  frame.)  Under  these  policies,  states  X. . ,  with  M'  <  r, . ,  < 
M  may  not  be  reached  from  state  X,  with  0  <  r,  <  M  In  order  to 
preserve  the  irreducibility  condition,  we  therefore  need  to  define 
the  following  classes  of  policies  (referred  to  hereafter  as  "irreduci¬ 
bility  classes”)  based  on  the  different  barriers  imposed  on  the 
line-switched  slot  numbers  per  frame:  the  set  ( w  j  of  all  policies  is 
divided  into  the  mutually  exclusive  subsets  or  irreducibility  classes 
(w)„  ,  0  <  A/'  <  M .  such  that  anv  policy  belonging  to  the  class 
(ir}„  may  assign  actions  achieving  but  never  exceeding  Af 
(S,  <  M  ). 

If  we  limit  the  search  for  an  optimal  policy  within  a  particular 
class  {*)„  .  the  state  action  space  is  reduced  to  0  *  r;  r.  M'  and 
0  T  5,  5  V  Obviously,  any  deterministic  policy  belonging  to 
(tt)„  results  in  an  irreducible  Markov  chain:  (I)  permits  i-  to 
take  any  value  up  to  CMAX;  (2)  lets  r,  take  anv  value  up  to  .V, 
with  .S,  <  M'.  and  (3)  from  M/C/S  queue  analysis  defines  an 
irreducible  process  with  0  •  /.,  r-  Q  for  an-  .S’,  pattern  as  long  as 
the  frame  format  can  accommodate  the  average  traffic  require¬ 
ment  The  latter  condition  leads  to  lower  bounds  for  A f  and  M 
M'  must  be  greater  than  the  l.S  traffic  intensity  and  U  greater 
than  the  total  traffic  intensity  (line  switching  plus  pjcket  switch¬ 
ing)  Thus  A,  p-  'f  •  If  and  A,  u  -  A -/»•  M  This  reason¬ 
ing  suggests  that  the  irreducibility  condition  holds  within  an 
irreducibility  class  of  policies  and  for  the  accordingly  reduced 
state  space 


References 

|l)  F  Closs.  “Message  delays,  trunk  utiliranon  in  line  switching  and 
message  sw  itching  networks.”  in  A  UPS  Cimf  Pnn  .  1972.  pp 
524-529 

|2)  K  Kummerle  and  H  Rudin.  “Packet  and  circuit  switching:  Cost 
performance  boundaries."  Compui  \eruun l.t,  vol  2.  pp  3-17.  Feh 
I97X. 

(3|  H  Misaharj  er  a/..  “A  comparative  evaluation  of  switching  meth¬ 
ods  in  computer  communication  networks."  in  ICC  ■ Pn*..  pp 
n-n-h-IO 

|4|  D  Maiw  aid  ei ul .  "Integrated  commumcaiion  system  performance." 
in  ICC-’.i  Pro,  .  pp  24-13-24-21 

|5|  K  kummerle.  “Multiplexer  performance  for  integrated  line-  and 
packet-switched  traffic."  in  ICCC  P'<*  I9'4.  pp  SW-515 

|h|  P  Zafiropoulo.  "Flexible  multiplexing  for  networks  supporung 
line-switched  and  packcl-sw itched  data  traffic.”  in  /C'CV  P"h  . 
1974.  pp  517-523 

|7]  E  Port  ct  ul .  "A  network  architecture  for  the  integration  of  circuit 
and  packet  switching."  in  ICCC  Pn*  .  1976.  pp  >05-514 

|X]  P  Zafiropoulo  cr  ul..  “Extension  of  a  circuit- switched  user  network 
interface  to  packet  switching."  in  ICCC  P"h  .  1976.  pp  515-  522 

(9|  H  Rudin.  "Studies  of  the  imcgraiion  of  circuit  and  packet  switch¬ 
ing."  in  ICC-  'S  Prn,  .  pp  2x7  2  I  -  20  2  ' 

1 10)  ti  Cosiello  and  P  Vena.  “Integration  of  circuit  packet  swiicluna 
bv  a  Sl.NET  (stoned  envelope  network)  concept.  in  x  /( '■'<  /’r.s  . 
pp  42-12-42-17 

(ll|  "SF.NET  DA.V  Studv."  (iTF.  Final  Rep  Contract  DOA  KKl-’'5-C'- 
()07|.  1 97p 

(I2|  M  Ross  or  ul  .  "Design  approaches  and  performance  criteria  for 
integrated  voice  data  -witching."  /'os  ti  l  l  vol  h 5.  Scpi  I9'7 
(l.'l  (>  Costello «-r ul ..  “Sxstem  dc-icn  imphs.itions of  packctired  voice." 

in  ICC-"  Poh  .  pp  3*  >-49  ts  1.5 ; 

|I4|  "Economic  analvsis  of  integrated  DOD  voice  and  d.itj  network-.' 
Network  Analv-i-  C  orp  .  (ireai  Neck.  NT.  Final  Rep  Contract 
DAHC-15-7 VC-01  ts.  Scpi  I97h 

|I5|  I  tinman  and  It  I  rjnk.  'Economic  analv-i- of  micgralcd  voice 
and  dal. i  nelwork-  A  ca-e  -ludv.”  Puk  //  I  I .  vol  Not  I'P’s 
1 1 1-1  U.l'iht  \ctHink  .Sum  /ime  and  hull'll  Viorni.  Sv-lcm  IV 
-enption  Manual.  Tr.in  v  orpor.ilion.  I  I  Scgtindo.  I  Y  V(  is  I9"i- 
1 1 7 1  J  V.mdci  Mex.  The  architecture  of  .i  transparent  intelligent  net 
woik  in  X  /(  '<i  /'os  pp  7  2  1  "  ;  - 
J  IN)  It  M.igl.m-  and  M  5icliw.ui/.  IVrloiinaiice  c- ..lUi.ni.'iv  of  a  c.m 
jhlc  frame  itniltiptexei  lot  integrated  -wn-hed  neiwo-k-  "  to  h, 
puhli-hed  III  II  I  l  llil*l\  t  nHiMllir:  / »-(  ri»l-*/ 

|I9|  It  Misahjr. i  and  T  I Ij—t.jw.1.  Inleci.tied -w ik hing  with  v .irial-U- 


IFtf  TXANSAC  I  IONS  ON  INHMMaIION  I  III 'Hit  VIII  Il'K.  NO  2  M.sin  II 


m 


frame  and  packet."  in  li  t'-'.V  /V.«  .  pp  Jo  1  I  Ju  i  5 
j 20)  M  l:nhcf  and  T  Harris.  “A  nuclei  for  cv ahiation  ol  ilu  nt-ilm 
ntancc  of  integrated  citcii'l-  and  packet- si*  ,itlu-d  tiiuliiplr<  >kih 
lure."  f/.'/.'f  Trum  (  tmmttm.  f it  hiuil  .  sol  (.  OM-24.  pp  ho  Jii\ 
Feh.  If 76 

(2 1 J  C.  )  WeinMem  cr  ul .  "Data  traffic  performance  jmjIs ms  of  an 
integrated  eireuit-  and  packet-switched  multiples  structure  "  II  I  I 
Trum  Cimmwn  Til  fulfil .  sol  COM-2X.  pp  X7J-X7X,  June  Ifxtl 
1 22)  S.  Rosa,  Apphctl  PnAkihtluv  \Untvh  w nil  Opttnu;urnm  Applnutumy 
San  Francisco.  C A:  llolden-Day.  If 70 
(23)  L.  Kletnrock.  Qurueutx  .Vt  oenu.  vol  I.  New  York:  Wiley.  If75. 


1 J  *  I  C  J)eiiii.i*i.  fti.it,  \tuit '  Slutiin  Ifntuim  frurno  Nets  Vaik 
Aiadeniie.  Iy7n 

1 2s!  licrisck  is  |l .  Ii\iu:itu,  I'ri  ittuumtinv  ittitl  Stimhuitu  (  I.tilr. C  Seu 
Volk  As.nl.'tllis.  l‘Pn 

|26|  I  (Olniun  (r  u/  .  Issues  in  integrated  nets! or k  design."  in  /((  ■T' 

/•i<«  .  pp  >x  i  ip  -  Ik  i-ai 

iJ'l  l  R'lhm.  "<iroup  random. aceess  disciplines  for  muln  access  hioad 
cast  cliaiiitcl'."  I  Iff.  Trum  htfutn:  Unify.  vol  IT-24.  Sept  lf7X. 
pp  57X-Jf2 

(2X)  i.  Keilson.  “Markov  chain  models,  rartlv  and  cxponcniialilv."  Univ. 
of  California.  Berkeley.  Tech  Rep..  If74 


Satellite  Access  Methods  for  a  General  Purpose  Packet 
Network  with  a  Time  Critical  Traffic  Component 

Pasil  S.  Mag  laris  and  Tsvi  Lissack 
Network  Analysis  Corporation 
130  Steamboat  Road 
Great  Neck,  New  York  11024 

ABSTRACT 

Various  satellite  access  alternatives  are  studied  in  terms  of  accom  mods  tine 
packetized  traffic  mix  including  a  tight  delay  constrainted  component.  The  time  critical 
traffic  delay  due  to  contention  and  frame  latency  is  assumed  to  be,  at  most,  50  msec  with 
95^  confidence.  Two  major  access  categories  are  studied.  Under  the  first,  t^e  time 
critical  traffic  is  served  in  a  dedicated  subchannel,  either  via  slotted  ALOHA  random 
access,  or  via  TDM  slots.  As  a  result,  the  non-time  critical  bulk  traffic  is  decoupled  from 
the  stringent  delay  constraint  and  may  use  more  efficient  demand  assignment  technique. 
Under  the  second,  a  fixed  TDM  A  frame  is  partitioned  into  fixed  oortions.  Each  transmitting 
earth  station  fills  its  dedicated  portion,  giving  priority  to  the  time  critical  traffic.  The 
partition  should  be  able  to  accommodate  the  anticipated  time  critical  traffic  within  the 
given  specifications. 

Analysis  of  the  above  schemes  is  being  carried  using  slotted  ALOFA,  M/D/1,  anc 
M/D/N  oueueing  models.  It  is  shown  that  the  slotted  ALOHA  alternative  provides  a  very 
poor  candidate.  Among  dedicated  TDM  subchannels  and  Driority  TDMA  alternatives,  the 
latter  is  shown  to  provide  a  simple  and  robust  scheme,  especially  suitable  in  case  of 
relatively  fixed  traffic  proportionality  among  the  accessing  earth  stations. 


I.  INTRODUCTION 


Satellite  channels  have  been  widely  recognized  as  an  attractive  alternative  to  long- 
haul  voice  and  data  communications.  Specifically,  the  development  of  high  speed  digital 
transmission  hardware  and  high  speed  burst  modems  permits  the  transmission  of  digital  data 
at  rates  above  50  Mbps.  Broadcast  geostationary  satellites  have  been  used  for  the  transfer 
of  packetized  data  among  a  population  of  users  with  appropriate  earth  station  (ES) 
equipment.  The  corresponding  protocols  for  satellite  packet  network,  e.e.,  satellite  access, 
transponder  capacity  sharing  and  error  control  have  been  developed,  based  on  the  satellite 
channel  salient  characteristics:  Broadcast  nature,  250-270  msec  one-hop  propagation  delay, 
multiple  access  (shared  use  of  the  satellite  transponder  capacity),  and  Gaussian  noise 
statistics.  Frequency-division  multiple  access  (FDMA)  techniques  have  been  traditionally 
associated  with  voice  satellite  communications.  Recently,  however,  time-division  multiple 
access  (TDMA)  has  been  suggested  as  a  more  flexible  and  efficient  technique  for  both 
digitized  voice  and  packet  data.  In  TDM  A.  the  data  from  each  earth  station  are  segmented 
into  bursts  of  transmissions  which  are  timed  and  interleaved  within  a  time  frame.  Initial 
acquisition  and  synchronization  are  achieved  via  frame  end  burst  overhead  symbols. 

Depending  on  the  flexibility  of  a  TpVA  scheme  to  adapt  itself  to  traffic  variations, 
the  following  three  major  classes  are  defined  in  {  1  ]  :  Dedicated  uplink/dedicated  downlink 
(Fixed)  TDM  A  with  fixed  data-rate  point-to-point  links  connecting  every  possible  source 
destination  pair,  dedicated  uplink/shared  downlink  having  a  fixed  length  burst  assignment 
per  source,  dynamically  filled  with  various  destination  data,  and  shared  uplink /shared 
downlink  with  the  source  burst  assignment  dynamically  varying  according  to  the  accessing 
earth  station  traffic  requirements.  From  the  above  approaches,  the  first  is  unnecessarily 
rigid  and  wasteful  for  a  packet  data  network.  The  second  and  third  provide  viable 
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alternatives  depending  on  the  traffic  burstiness  and  delay/throuehput  reauirements.  For  a 
bursty  interactive  packet  environment,  demand  assignment  techniques  have  been  developed 
from  the  early  satellite  communication  phases.  These  techniques,  which  are  associated  with 
the  shared  uplink/shared  downlink  class,  ranee  from  random  access  pure  and  slotted  ALOHA 
[2],  [3]  to  implicit  [4],  and  explicit  reservation  [  5],  [6]  techniques.  The  latter  schemes 
introduce  centralized  or  distributed  control  mechanisms  in  order  to  resolve  scheduling 
conflicts  among  the  accessing  units,  thus  increasing  the  system  throughput  at  the  cost  of 
additional  delay  and  implementation  complexity. 

A  natural  extension  to  the  application  of  packet  satellite  network  for  interactive 
traffic  has  been  the  us*1  of  the  satellite  large  bandwidth  resources  and  broadcast  nature  to 
support  multiple  puroose  integrated  digital  communications.  These  networks  should  be  a^le 
to  provide  timely  and  reliable  data  communications  to  a  variety  of  users  end  application^.  It 
is,  therefore,  necessary  to  devise  priority  access  schemes  in  order  to  satisfy  performance 
requirements  for  traffic  eateeories  ranging  from  interactive  Dacket  delivery,  requiring  les< 
than  half  a  second  end-to-en^  delay,  to  message-switching-  electronic  document  distribution, 
overnight  large  file  transfer,  and  voice/video  communications.  The  POPA  Drotocol 
described  in  [  1  ] ,  is  a  tyrvral  scheme  aiming  at  dvnamicelly  accommodating  such  a  diversity 
of  applications. 

The  main  issue  associated  with  a  general  purpose  integrated  satellite  packet  network 
is  the  handling  of  the  time  critical  applications.  In  order  to  provide  o-'mDarable  service  to 
terrestrial  value-addin?  networks,  the  end-to-end  delay  should  be  of  the  order  of  300-3 A 0 
msec  including  access  aueueing  delay  and  latency  end  the  250-270  msec  propagation  delay. 
Obviously,  no  explicit  reservation  scheme  can  fulfill  the  above  requirement  since  it 
introduces  an  additional  one-hop  reservation  delay.  A  direct  access  scheme  should, 
therefore,  be  evaluated,  such  that  frame  latency,  queueing  and/or  retransmission  statistics 
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do  not  violate  the  given  specifications.  The  stringent  time  critical  delay  requirement  is 
expected  to  result  in  some  inefficiency  in  both  the  time  critical  and  the  non-time  critical 
(bulk)  traffic  capacity  utilization.  It  is  the  Durposc  of  this  paper  to  comparatively  evaluate 
various  feasible  access  methods  and  determine  their  design  parameters  as  a  function  of  the 
packet  rate  of  the  time  critical  traffic.  Two  general  access  categories  are  studied,  as 
shown  in  Figure  1.  Under  the  first,  the  time  critical  traffic  access  a  dedicated  subchannel 
either  via  slotted  ALOMA  or  via  time-division  multiplexed  ('rDM)  slots  dedicated  tc  every 
earth  station.  As  c  result,  the  bulk  traffic  is  decoupled  from  the  very  stringent  delay 
requirement  and  may  use  an  appropriate  dynamic  assignment  techniaue.  Under  the  second 
category,  a  TDMA  frame  is  partitioned  into  segments  or  slots,  preassigned  to  the  accessing 
earth  stations.  The  slot  sizes  depend  on  the  relative  traffic  intensities  of  the  earth  stations 
and  are  updated  periodically  with  the  period  spanning  several  frames.  Ee^-h  transmitting 
earth  station  fills  its  dedicated  slot,  giving  priority  to  the  time  critical  trafuc.  An  V/P/N 
queueing  model  is  used  to  define  the  minimum  slot  size  such  that  to  guarantee  proper 
delivery  of  the  time  critical  oackets.  For  a  description  of  this  priority  TPM  A  protocol  see 
[V]. 

The  models  used  for  the  performance  evaluation  of  the  above  strategies  assume  a 
finite  population  of  earth  stations  generating  fixed-size  time  critical  packets  with  Poisson 
statistics.  The  packet  arrival  rate  A  is  identical  for  all  earth  stations.  Note,  that  the 
Poisson  assumption  is  justified  by  the  realistic  model  of  a  large  terminal  population 
clustered  around  a  particular  earth  station.  The  tolerable  time  critical  access  delav  is 
assumed  throughout  this  paper  to  be  50  msec  within  a  95°c  confidence  interval.  This 
criterion  provides  a  more  meaningful  performance  measure  than  the  average  delev,  despite 
the  analytical  complexity  it  involves.  Whenever  analytically  inaccessible,  the  95^ 
percentile  is  estimated  from  first  and  second  moments.  A  discrete  event  simulation  is  used 


to  check  the  validity  of  the  above  estimation.  The  50  msec  access  delay  specification  is  a 
reasonable  choice  resulting  approximately  in  300-350  msec  end-to-end  delay  including 
processing  and  propagation. 


D.  DEDICATED  TIME  CRITICAL  SUBCHANNEL  STRATEGIES 


In  this  section,  we  evaluate  the  two  methods  for  handling  the  time  critical  traffic  in  a 
separate  dedicated  subchannel.  Since  these  methods  decouple  the  time  critical  and  non¬ 
time  critical  components  of  the  traffic,  their  adequacy  and  ability  to  meet  the  delav 
constraints  are  prerequisites  for  the  use  of  demand  assignment  reservation  schemes  to 
handle  the  bulk  components. 

A.  Slotted  ALOHA  Acces  Analysis 


A  slotted  ALOHA  subchannel  can  be  used  to  provide  direct  random  access  to  time 
critical  interactive  (IA)  packets.  The  250-270  msec  one-hop  propagation  time  results  in  a 
500-550  msec  delay  to  receive  an  ACK  or  detect  a  collision.  In  case  of  collisions  and 
retransmissions,  the  time  critical  end-to-end  delsy  is  violated  by  far.  Hence,  the  onlv  viable 
solutior  is  to  keep  the  collision  probability  below  5%  allowing  the  05°-  of  the  IA  packet*  to 
experience  only  the  frame  latency  assumed  less  than  50  msec.  (By  frame  latency,  we  mean 
the  delay  experienced  by  a  packet  from  its  arrival  to  the  time  it  mav  be  considered  for 
service.) 

The  slotted  ALOHA  subchannel  consists  of  slots  of  length  L  (in  Kbits),  T  sec  aDart  a* 
shown  in  Figure  2.  The  time  critical  packets  of  f’xed  length  L,  are  generated  in  the  V 
transmitting  ES's  with  Poisson  rate  X  packets/sec  and  contend  for  the  allocated  slots. 
Collisions  detected  after  500  msec  are  handled  by  retransmitting  the  collided  packets  in 
future  slots  selected  at  random  over  an  infinite  interval.  Following  the  analysis  of  the  finite 
population  model  in  [2],  we  have  thet  the  average  number  of  retransmissions  per  packet  r 
is  given  by, 
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(1) 


E 


-  1 


where,  S  =  M*X*  T  and  G  is  given  by  the  nonlinear  eouation, 


s  .  on  -%',M 


<3) 


The  probability  of  collision  for  small  values  of  E  will  be  approximately  equal  to  E, 

E  =  0  -P(O)  +  l-P(l)  +  •••  ~  Pfl) 

To  evaluate  the  performance  of  the  system,  we  must  compute  the  maximum  repetition 
interval  T  for  which  E  *  5%  under  various  arrival  rates  X.  Obviously,  more  frequent 
repetition  of  the  dedicated  slots  (small  T)  results  in  putting  aside  a  greater  portion  of  the 
channel  bandwidth  for  the  time  critical  traffic,  more  fragmentation  of  the  frame  format 
and,  hence,  deterioration  of  the  system  performance. 

F.  TDM  Subchannel  Analysis 

The  model  assumes  that  slots  of  length  L,  dedicated  to  every  accessing  ES,  are 
separated  by  T  sec  as  in  Figure  3.  The  repetition  period  should  again  be  less  or  equal  to  50 
msec  to  account  for  the  frame  latency.  The  time  critical  packets  of  length  L  arrive  with 
Poisson  rate  Xand  wait  on  a  FIFO  (First-In,  First-Outl  single  server  queue  at  the  source  F5. 
Service  of  outstanding  packets  occurs  every  T  sec  via  the  dedicated  slot.  An  M/D/1  analysis 
is  performed  to  compute  the  first  and  second  moments  of  the  access  delev.  (For  a  similar 
treatment,  see  [8].)  U'e  assume  that  the  total  delay  D,  consists  of  the  sum  of  two 
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independent  random  variables:  The  uniformly  distributed  latency  l,  0  <  l<  T  and  the  V/P/l 

2 

queueing  W.  The  mean  Pand  variance  on  are  then  given  by, 


X*  T 


2f  1  -  *T) 


+  I 
2 


•£•£**  [ 


X»  tZ 
2(1  -  XT) 


+ 


3(1  -  Xt) 


(3) 

(4) 


Let  denote  the  95  percentile  of  the  total  access  delay.  We  define  the  spread  factor  SF 
as  the  factor  which,  when  multiplied  bv  the  standard  deviation,  determines  the  95?5 
confidence  interval;  namely, 


D-95  =  P  *EF'  °D 


'5) 


Various  estimates  can  be  considered  for  the  value  of  the  SF  in  (5).  Using  Chebyshev’s 
inequality,  it  turns  out  that, 

Pr  fp  >  P  +  SF  •  a  )  =  5°'  <  1  -y  ~  -  — >  SF-4*° 

r'  ‘  (SFV 

This  is  a  loose  UDper  bound  on  SF.  Other  relaxed  ad-hoc  choices  may  be  SF  =  3,  and  SF  = 
1.9(1  corresponding  to  the  normal  distribution.  A  more  reasonable  choice  can  be  found  using 
the  following  argument:  If  T  =  50  msec,  the  probability  of  meeting  the  (P^.*^  50  msec} 
constraint  is  almost  equal  to  the  probability  of  an  arrival  in  an  empty  queue  (the  access 
delay  consists  of  latency  onlv).  This  probability  ecuals  to  X  T.  In  order  to  make  this 
probability  equal  to  5°*,,  we  select  X  =  1  pecket/sec  with  T  =  50  rrspc. 


■*  I, 

=  50  msec 

X  =  1  p/sec 
T  =  50  msec 

From  (3),  (4),  f5)  with  X  =  ]  p/sec  and  T  =  50  msec,  we  conclude  that  FF  =  1.5.  In  Table  1, 
we  present  the  95%  delay  under  X  =  i  oacket/sec  and  T  =  50  msec  for  different  SF  choices. 

Our  choice  SF  =  1.5  leads  to  the  most  realistic  value. 

The  M/D/l  95%  analysis  presented  in  this  section  is  used  to  evaluate  the  maximum  IA 
rate  which  does  not  result  in  more  than  50  msec  delay  within  95%  confidence  for  different 
values  of  T.  Again,  small  values  of  T  correspnd  to  larger  percentages  of  the  total  channel 
capacity  dedicated  to  the  time  critical  subchannel. 


r.  Evaluation 

The  analyses  oresented  in  the  previous  sections  have  been  used  to  evaluate  the 
required  dedicated  caoacitv  (in  percents  of  total  available  capacity)  as  a  function  of  the 
packet  arrival  rate  of  the  time  critical  traffic.  Two  wideband  satellite  channel  capacities 
have  been  chosen  as  reference  examples:  r  =  14  Mbps  and  C  =  20  Mbps.  The  number  of 
accessing  ES's,  V,  assumes  two  extreme  values  5  and  20.  The  ceoacity  dedicated  to  the 
time  critical  subchannel  C^r,  follows  directly  from  the  corresponding  repetition  time  T  of 
the  dedicated  slots  of  length  L  =  1  Kbit.  Thus,  for  the  slotted  ALOHA  case, 

r 

TO  I. 

c  t*  r 

For  the  dedicated  TO'!  u  ~e, 


(0) 
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The  numerical  results  are  plotted  in  Figures  4  and  5.  As  it  can  be  seen,  slotted  ALOHA 
saturates  much  faster  than  the  dedicated  TDM.  This  is  due  to  the  fact  that  the  slotted 
ALOHA  scheme  schould  be  designed  such  that  the  probability  of  contention  is  less  than  5% 
to  account  for  the  500  msec  delay  induced  with  any  collision.  The  dedicated  TDM  can  sllow 
contention  of  arriving  packets  at  the  transmitting  queue.  If,  for  example,  the  dedicated  slot 
is  repeated  every  10  msec,  a  packet  may  remain  on  queue  for  five  periods  without  violating 
the  50  msec  delay  constraint.  Therefore,  the  increase  of  the  dedicated  caDacity 
(equivalents  the  decrease  of  the  period  t)  has  a  greater  impact  on  the  dedicated  TDM  than 
on  the  slotted  ALOHA  scheme.  In  addition,  S-ALOHA  schemes  involve  more 
imolementation  complexity  and  require  flow  control  for  stability.  Ke  conclude,  therefore, 
that  dedicated  TDM  is  preferred  to  S-ALOHA  for  handling  the  time  critical  traffic  on  a 
dedicated  subchannel. 

Another  important  advantage  of  using  a  dedicated  subchannel  approach  is  that  these 
schemes  decouple  the  time  critical  from  the  bulk  traffic,  thus  permitting  a  dynamic 
assignment  scheme  for  the  letter.  Their  mein  disadvantages  are  due  to  the  protocol 
complexity  of  handling  seoarete  log;cal  links  and  the  overhead  associated  with  the  frame 


fragmentation. 


ni.  PRIORITY  TDM  A 


A.  Motivation  and  Descriotion 


This  section  deals  vnth  the  indications  of  incorporating  the  time  critical  IA  traffic 
within  the  bulk  traffic  access.  Ke  concluded,  in  the  previous  section,  that  slots  of  length  L, 
dedicated  to  every  transmitting  source  and  dispersed  in  fixed  time  intervals,  provide  an 
acceptable  mechanism  for  handling  the  time  critical  packets.  This  motivated  the  design  of 
the  simple  and  robust  priority  TDMA  scheme  presented  in  this  section.  Instead  of  the  single 
server  model  of  Section  II,  B,  we  simplv  collocate  all  slots  dedicated  to  one  source  for  a 
frame  period  less  than  or  equal  to  50  msec.  By  doing  so,  we  reduce  the  frame  fragmentation 
overhead  and  allow  the  non-time  critical  traffic  to  occupy  anv  idle  capacity  not  used  by  the 
time  critical  packets  (recall  that  the  95°o  confidence  constraint  results  in  large  percentages 
of  unused  capacitvl.  V.e  briefly  describe  its  main  featvires  (see  Figure  61; 

All  traffic  accesses  the  channel  via  a  T  sec  frame,  T  <_  50  msec.  The  frame 
length  should  be  long  enough  to  result  in  acceptable  fragmentation  without 
violating  the  50  msec  latency  constraint.  Note,  that  the  high  speed  satellite 
channel  is  the  main  reason  for  making  this  scheme  attractive.  As  an  example,  c 
50  msec  frame  carries  1  Mbits  when  using  p  20  Mbps  channel. 

A  variable  sixe  slot  is  allocated  to  every  ES.  The  minimum  allocation  ner  KS 
should  he  such  that  95a  of  the  arriving  time  critical  packets  can  hr 
accommodated  within  50  msec.  The  ES's  serve  the  time  critical  packets  with 
non-pree motive  priority  over  the  other  traffic  classes. 
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The  slot  allocation  remains  fixed  for  a  period  of  time.  Every  ES  has  full 
responsibility  to  allocate  its  data  according  to  given  priority  tebles  and  using 
some  scheduling  algorithm.  Thus,  the  scheme  can  be  classified  as  fixed 
uplink/shared  downlink  multiple  access.  It  is,  however,  possible  to  adant  the 
capacity  assignment  to  the  traffic  variations  using  a  centralized  control 
mechanism  as  in  [7].  The  central  controller  ES  can  update  the  allocations 
either  by  monitoring  the  ES's  traffic  statistics  or  after  considering  periodic 
traffic  request  reports.  Note,  that  the  synchronization  and  allocation  update 
processes  mav  span  many  frames.  In  addition,  the  complexity  involved  in 
updating  the  slot  allocation  suggests  that  this  should  not  be  performed  very 
frequently  ftvpicallv  ever'1  1.000  or  2,000  frames).  The  scheme  can,  therefore, 
adapt  to  relatively  slow  variations  of  the  traffic  mix:  this  is  the  case  of  a  large 

population  of  terminals  connected  to  an  FS.  resulting  in  quite  smooth  traffic 

with  no  rapid  peaks  due  to  averaging  effects. 

In  the  following  section,  ve  present  the  analytic  tools  needed  to  evaluate  the  minimum 
allocation  per  ES  in  order  to  guarantee  proper  time  critical  service. 

B.  Delay  Analysis 

The  95S-.  confidence  interval  delev  analysis  is  based  on  the  M/P/N  oueue  model  of  tU;e 
TDM  A  scheme.  S’miler  multiserver  queueing  models  hpve  been  reported  in  [°] ,  [in],  and 

[11].  The  m/D/v.  queue  is  an  accurate  model  for  the  time  critical  packet-service 

mechanism  of  Figure  C.  Specifically,  the  service  time  of  a  packet  is  defined  as  the  frame 
duration  T,  since  all  packets  are  considered  for  service  at  the  beginning  of  a  new  frame. 
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The  number  of  servers  N.  corresponds  to  the  number  of  time  critical  packets  fitting  in  slot  i. 
Whenever  the  allocation  0.  is  greater  than  the  minimum  N\,  the  delay  constraint  is  further 
relaxed.  We  focus,  therefore,  on  the  computation  of  the  minimum  allocation  NL.  For 
simplicity,  we  will  drop  all  subscripts  i,  denoting  ESj.  In  order  to  simplify  our  analysis,  we 
assume  that  the  frame  length  T  is  decided  such  that  the  95%  delay  D  =  50  msec 
corresponds  to  an  integer  number  of  frames: 

A  ^.<>5 

T  =  Tk  =  — — K  =  1,  2,...  (8) 

The  delay  D  encountered  by  a  packet  will  depend  on  the  number  of  packets  q  found  in  queue: 
specifically,  we  have, 

D  =  Latency  if  q  <  f> 

D  =  Latency  +  Tv  if  N  <  q  <  2N 

P  -  Latency  -  2T^.  if  2V  <  q  <  3N 

P  ~  Latency  +  iT(<.  if  N  <  q  <  (i  +  1  )N 

Furthermore,  since  the  frame  latency  equals  to  one  frame  with  100%  confidence,  we  have, 

c  <  iN  P  <  iT„ 

-  K 

It  immediately  follows  that  with  P.Q-  =  K  •  T,., 

Pr  {  P  <  D.95  }  =  Pr  {  q  <  KN  }  (91 
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We  wish  to  evaluate  A.,,  the  maximum  rate  of  the  time  critical  traffic  for  which  the  r.h.s  of 

K 

(91  is  kept  above  959t : 


•95 

TK  =  "k  ’  K  =  2’* 


Find,  A,.  =  max  {A} 


Such  that,  Pr  {P<Dflr}  =  Pr  {q  <KN>  >  95% 


The  queue  state  statistics,  q,  needed  to  evaluate  (10)  are  summarized  in  the  Appendix.  For 
K  =  1,  Pr  {q  <  N}  is  given  explicitlv  by  (A-3)  in  the  Appendix.  Thus,  the  maximum  rate  A 
can  be  easily  computed  for  various  values  of  N  using  a  simple  search  algorithm.  For  K  >  1, 
no  exact  formula  for  the  probability  Pr  {  q  <  KN}  is  available.  Instead,  we  use  an 
approximate  method  based  on  the  first  and  second  moments  of  the  queue,  state  as  evalueted 
in  the  Appendix.  Our  approximation  consists  of  the  assumption  that  the  M/D/N  oueuc  size 
distribution  with  arrival  rate  A j,-  and  frame  length  T^,  as  defined  in  (10\  will  have  for  a 
given  N,  the  same  shape  if  normalized  with  respect  to  K.  Under  the  above  assumption,  for 
every  N  we  define  the  sppead  factor  SFfN),  such  that  the  95%  queue  size  KN  of  (10)  is. 


KN  =  rK(q)  +  SF(N)  oQ  y 


Here,  E,,  (q)  and  o  ere  the  mean  and  standard  deviation  of  the  aueue  size  under  T„  and 
K  q,  a  '  n 

given  by  (A-4)  and  (A-5)  in  the  Appendix.  The  assumption  that  SPfN)  is  independent  of 
K  permits  its  analytical  evaluation  from  the  known  case  K  =  1.  The  SF(N)  estimate  can  then 
provide  confidence  intervals  for  larger  values  of  K.  Other  choices  for  the  spread  factor. 
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such  as  SF  =  4.5  (Chebvshev's  inequality),  SF  =  3,  SF  =  1.96  (normal  approximations),  may 
prove  valid  for  specific  parameter  sets,  but  are  inflexible  in  adjusting  to  every  particular 
queue  distribution.  Our  method,  which  provides  flexible  and  tieht  estimates  has  been 
checked  via  discrete  simulation  (see  the  next  section). 

The  corresponding  algorithm  is  summarized  below: 

Step  1:  For  a  given  D.gc.  and  N,  set  K  =  1,  T1  =  D.95- 

Find  =  max  (a)  ,  such  that  Pr  {  q  <  N  }  >  95%  (Eq.  A-31 


Step  2:  With  x  =  A-j  and  T  =  compute  (q)  (Eo.  A-4)  and  ,  (Eq.  A-5). 


Compute  SF(N)  = 


N  -  Ej  (o) 
°o.  1 


Sten  3: 


For  K  =  2, -3,...,  end  T  = 


•95 


Find  Xj,  =  max  {X}  ,  such  that  K NT  <  E^.  (a)  -*■  SF(N).  0  ^ 

where.  E,.  (q)  and  o„  are  riven  from  Eas.  (A-4)  and  (A-5)  with  X  =  X 
a.  is 


T  =  T 


K* 


C.  Numerical  Results  and  Evaluation 

The  analvsis  of  the  previous  section  has  been  carried  out  with  D  q-  =  50  msec  and  K  = 
1,  2,  3,  4  correspond  nr  to  frame  durations  =  50  msec,  =  25  msec,  =  16.66  msec, 
and  =  12.5  msec.  The  minimum  number  N  of  capacity  "quanta’’  (a  quantum  can 
accommodate  one  time  critical  packet  length  £.)  is  varied  from  1  to  45.  The  correspondnr 
tolerable  time  critical  packet  rates  x  K  =  1,  2,  3,  4,  have  been  computed.  A  fairly  simple 
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time  driven  simulation  program  has  been  used  to  simulate  the  1V/P/V  aueue  recursive 
equation  (see  Appendix).  Experiments  have  been  conducted  to  evaluate  the  probabilitv  F ^ 
of  violating  the  50  msec  delay.  The  simulation  was  run  over  10,000  frames  and  statistics 
have  been  taken  after  the  first  1  ,000  transient  frames. 

In  Table  2,  we  tabulate  the  analytically  obtained  X  for  N  =  1  ,...,45  and  the 
corresponding  simulation  result  P^.  In  the  same  table,  ve  show  t^e  various  SF  (spread 
factor)  choices.  'Fhese  results  are  plotted  in  Figures  7  and  8.  Specifically,  in  Figure  7,  we 
show  the  maximum  tolerable  rate.  X,,  versus  the  total  number  of  cacacitv  quanta  dedicated 
to  a  particular  ES  during  1  sec  (N  slots  in  a  T9  =  75  msec  scheme  correspond  to  2N  slots  in 
the  T^  =  50  msec  case).  The  quite  striking  observation  from  Figure  7  is  the  considerable 
improvement  obtained  under  the  T0  =  2 5  msec  frame  over  the  T^  =  50  msec  one.  Thom 
imorovement,  ranging  from  50cv  to  709^,  can  be  understood  by  observing  that  the  one  fram^ 
queueing,  tolerated  under  the  T?  =  25  msec  frame  scheme,  has  an  averaging  effect  on  the 
bursty  Poisson  arriving  traffic.  It  is  remarkable  to  see  that  further  partitioning  of  the 
frame  (by  a  factor  of  3  and  4)  does  not  lead  to  any  substantial  improvement;  this  is  due  to 
the  fact  that  the  P5c.l  confidence  constraint  limit'  the  utilization  to  lew  levels  and,  hence, 
the  probability  of  large  aueue  sizes,  requiring  more  than  two  frames  queueing,  is  very  small. 

In  Figure  8,  we  plot  the  simulation  derived  orobebilities  of  violating  the  50  msec 
constraint  for  the  (N,  XjP  pairs  computed  using  the  analytic  technique  described  above. 
These  probabilities  fall  below  the  5°f,  specification,  thus  indicate  that  our  analysis 
assumptions  are  valid  and,  even  more,  rather  conservative.  Finally,  the  histograms  of  tie 
aueue  size  distribution  are  plotted  in  Figure  9  for  sample  values  of  N  and  the  extreme  ease' 
K  =  1  and  K  =  4.  These  histograms,  obtained  from  the  10,000  frame  simulations,  show  that 
the  queue  size  distributions  have  similar  shapes  for  a  given  N  but  differ  for  different  N 
values.  Hence,  our  algorithm,  which  evaluates  for  every  choice  of  N  e  corresponding  spread 
factor  SF(N),  is  based  on  a  reasonable  assumption. 
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Note,  that  the  results  plotted  in  Figure  7  are  independent  of  the  time  critical  packet 
size  (equal  to  the  capacity  quantum  size).  Figure  7  can  be  used  as  a  v/orking  tool  in  order 
to  evaluate  the  minimum  allocation  per  FS  for  frame  sizes  T  =  50  msec  and  T  <_  25  msec.  It 
can  be  easilv  seen  from  the  figure  that  frame  lengths  shorter  than  25  msec  does  not  provide 
substantial  irrnrovement.  Ve  can,  therefore,  use  the  25  msec  frame  to  approximately 
evaluate  the  minimum  capacity  for  anv  value  T  <  25  msec.  As  an  example,  an  ES  with  time 
critical  packet  arrival  rate  A  =  150  packets/sec  and  frame  length  T  =  20  msec,  needs  under 
the  25  msec  curve  of  Figure  7,  200  ouanta/see  or  4  quanta  per  20  msec  frame.  This 
technique  relaxes  the  simplifying  assumption  (8)  of  integer  number  of  frames  to  the  P>q-  = 
50  msec  delay  specification.  In  case  that  no  other  constraints  are  imposed  (e.c.. 
synchronization,  etc.),  the  frame  length  should  be  set  eoual  to  /2=  ?5  msec  for  near- 

optimal  utilization  and  reduced  frame  fragmentation. 
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IV.  CONCLUSIONS 


The  various  satellite  aeoess  alternatives  for  the  time  critical  traffic  in  a  genera] 
purpose  satellite  packet  network  have  been  analyzed  and  evaluated.  The  stringent  end-to- 
end  delay  specification  has  imposed  a  50  msec  access  delav  wit!1  95°,'.  confidence.  Tvo 
general  categories  have  been  considered.  The  dedicated  subchannel,  first  category,  consists 
of  access  schemes  which  decouDle  the  time  critical  traffic  service  from  the  total  bulk- 
traffic  access  mechanism.  The  dedicated  subchannel  is  used  for  the  direct  access  of 
outstanding  time  critical  packets.  The  inevitablv  introduced  contention  should  be  kept  such 
that  the  delav  constraint  is  not  violated.  The  analysis  has  clearly  shown  that  slotted 
ALOHA  random  access  reduces  substantially  the  system  efficiency  over  the  dedieated  Tl'"-' 
subchannel  strategy.  Furthermore,  th°  TPV1  scheme  is  superior  ;n  terms  of  its  stability  and 
robustness  and  the  inherent  capability  to  provide  a  control  and  reservation  subchannel  for 
the  bulk  traffic,  ^he  bulk  Jew  priority  traffic  can  be  served  via  any  demand  assignment 
scheme  involving  a  traffic  adapting  control  mechanism,  e.g..  explicit  reservations.  The 
mechanism  of  decoupling  the  time  critical  component  of  the  traffic  is,  however,  associate'’ 
with  considerable  protocol  complexity  Rnd  involves  various  fragmentation  overheads. 

The  second  category  consist';  of  the  Priority  TDHA  scheme  which  provides  a  unifor-i 
method  for  ail  traffic  categories.  The  accessing  FF's  are  responsible  to  schedule  the  pendin’' 
traffic  within  their  transmitting  bursts  according  to  priority  rules.  A  centralized  cont"o! 
mechanism  partitions  the  transponder  capacity  into  dedicated  variable  size  t;me  slots, 
comprising  a  frame,  such  that  P5T  cf  the  outstanding  time  critical  traffic  vl!  be  giver 
access  within  50  n  sec.  An  v/n/N  queueirg  analysis  has  been  used  to  evaluate  the  mininur- 
capacity  assignments  which  guarantee  the  validity  of  the  delav  constraint  over  a  range  of 
anticipated  time  critical  traffic  rates.  A  frame  length  of  °5  msec  has  been  found  to  provide 


near  optimal  capacity  utilization.  Among  the  obvious  advantages  of  this  scheme,  we 
mention  its  stability  and  robustness,  single  logical  link  protocol  simplicity,  reduced 
fragmentation  overhead  and  flexibility  to  accommodate  on  Driority  the  ES  traffic  mix.  Its 
disadvantage  lay  on  its  rigiditv  in  real  time  adapting  to  fast  traffic  variations  among  ES's. 

As  a  general  conclusion,  we  feel  that  the  priority  TDVA  scheme  provides  a  simple  and 
attractive  access  candidate,  especiallv  in  the  case  of  ES’s  collecting  data  horn  a  laree  and 
relatively  fixed  terminal  population.  The  bursty  character  of  a  terminal  traffic  is  then 
everaged  over  the  entire  poouletion  and  does  not  dramatically  affect  the  transponder 
partition  requirements.  Non  real-time  bandwidth  adjustments  are,  of  course,  possible  via  p. 
centralized  reallocation  orotoeol. 


APPENDIX;  M/D/N  QUEUE  STATISTICS 


Let  T  be  the  frame  duration  and  ythe  time  critical  packet  arrival  rate  in  packets 'sec. 

t  ^ 

The  queue  state  q.  will  he  the  number  cf  packets  in  aueue.  just  after  the  opening  of  the  i 
frame.  With  Vj»  new  arrivals  generated  during  the  j^  freme  and,  at  most  N  packets  served 
per  frame,  8?  in  Figure  A.l,  the  queue  difference  equation  will  be: 


aj+i  =  '°j  -  N'  4  Vj 


where, 


0  if  a  <0 


a  if  a  >  0 


and  v -j  is  Poisson  distributed  with  moor  c  =  <  N.  From  the  v/P/N  queue  analvsis  in 

[12].  the  stationary  penorfitinc  function  of  the  queue  stete  probabilities  is  given  by. 


C'.'Z'i  = 


fy  -  pi  I?  -  i 


(A-n 


1.  -z  Fv  P  Ifh  -  7)] 


where,  z  are  the  (y-\)  root'.  iZ  I  <  1  of  the  equation. 

r  r 


7. 1  rvr  [pH  -  7)  ]  = 


(A-°i 


The  h.  root  of  FA-11  jc  the  obvious  zv  =  1.  A  simple  numerical  algorithm  is  given  in  [1?1 
for  the  calculation  of  these  roots.  V .'itk, 


X„  =  Re  [z  ] ,  Y  =  Im  lz  J 

r  r  r  r 


it  easilv  turns  out  that  v  ,  Y  satisfy  the  nonlinear  svstem, 
"  r  r 


Yr  =  FXP 


[2r  II  +  p  Y  "1 

- N - ~ 

f?r  n  +  pY  1 

[p/N  (Xp  -  1)  ]  SIN  I - w - 


From.  Eo.  (A-1 )  ve  pet.  after  some  alpebra,  the  fellow-in?  queue  size  statistics, 


Pr  {o  <  N  }  =  F  pfq  =  i)  =  ',v,  _  ~ - 

i  =  o  n  (i  -  2  i 


(A-3) 


Note,  that  this  is  the  probability  that  an  arrival  encounters  at  most,  N  -  1  packets  in  aueuc 
and,  hence,  yets  access  im  media  telv  into  the  next  frame.  The  mean  and  variance  cf  c  are 
yiven  by, 


E(c)  =  I 


O  O 

N  -  N  '  +  0  ‘ 


1  -  7.  ?(N  -  P) 


<A-') 


o  ?  =  C"(l >  +  P'(l)  -[  O’0>]  ? 

q 


(A-S) 


y?  +  p2  -  N  -  ?  ON 
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TABLE  Is  D  95  FOP  VARIOUS  SF  CHOICFS 


A  =  1  packet  /see  T  =  50  msec 


SF  =  4.5  (Chebyshev) 


=  98.1  msec 


SF  =  3.0 


90.0  msec 


SF  =  1.90  (Normal) 


=  77.9  msec 


SF  =  1.5 


=  50.2  msec 


TABLE  2:  MAXIMUM  TIME  CRITICAL  PACKET  RATE 


F^ame 

50  msec 

25  msec 

16.66  msec 

12.5 

msec 

N 

X1 

Pi* 

X  2 

P2% 

n 

P3% 

■ 

n 

SF 

1 

1 

4.72 

6 

1.20 

18 

1.60 

34 

1.00 

4.19 

2 

6 

3.70 

33 

1.88 

73 

1.40 

115 

1.16 

3.07 

3 

16 

4.66 

73 

3.20 

135 

1.38 

200 

3.08 

2.42 

4 

26 

3.94 

110 

2.74 

195 

2.18 

280 

7.20 

2.34 

5 

38 

4.32 

150 

2.98 

255 

2.56 

360 

3.82 

2.22 

6 

51 

4.66 

190 

2. 96 

315 

2.64 

440 

1.54 

2.13 

7 

64 

4.54 

225 

2.12 

375 

3.48 

520 

5.22 

2.09 

8 

78 

4.56 

265 

2.90 

435 

4.02 

600 

5.00 

2.04 

9 

92 

4.34 

305 

3.12 

495 

4.02 

680 

3.80 

2.02 

10 

106 

4.26 

346 

3.20 

556 

2.90 

761 

4.48 

2.01 

15 

181 

4.46 

546 

3.78 

856 

4.04 

1151 

4.68 

1.95 

20 

261 

4.54 

746 

4.23 

1151 

3.50 

— 

1.91 

25 

341 

4.73 

945 

3.87 

— 

— 

1.90 

30 

426 

4.89 

— 

— 

— 

— 

1.85 

35 

511 

4.70 

— 

— 

— 

— 

— 

1.84 

40 

596 

4.58 

— 

— 

— 

— 

— 

— 

1.84 

45 

686 

5.04 

— 

— 

— 

— 

_ z _ 

1.79 

X  K 


Maximum  Rate  (Packets/Sec) 


PK  =  Pr  {  q/K  >  N  }  Simulation 
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FIGURE  2:  S-ALOHA  SUBCHANNEL 
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FIGURE  5:  DEDICATED  TDM  CAPACITY  FOR 
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FIGURE  6:  PRIORITY  TDMA  SCHEME 


FIGURE  9:  SIMULATION  RESULTS:  QUEUE  SIZE  DISTRIBUTION  OVER  10,000  FRAMES 


FIGURE  A. 1 :  THE  M/D/N  MODEL 
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ABSTRACT 


Coaxial  cable  based,  broadcast  Local  Area  Networks 
(LANs)  are  widely  recognized  as  an  effective  means  to 
provide  both  communication  capabilities  to  a  variety  of 
office  automation  equipment  as  well  as  interoperability 
among  data  processing  configurations.  This  paper 
presents  the  results  of  a  performance  analysis  of  a  typical 
environment  to  which  both  of  these  factors  contribute. 
The  environment  includes  a  variety  of  line  speeds  and 
protocols  and  access  is  made  to  remote  data  bases  and 
hosts. 

The  network  carries  messages,  packetized  to  the 
optimal  packet  length  and  transmitted  via  the  CSMA/CD 
protocol  (Carrier  Sensing  Multiple  Access/Collision 
Detection).  End-to-end  delay  •  and  access  delay  of  the 
CSMA/CD  protocol,  is  studied.  It  is  shown  that  lor  a 
realistic  traffic  mix,  the  principal  contribution  to  end-to- 
end  delay  arises  from  the  queueing  and  NIU  processing. 

Sensitivity  of  the  performance  of  the  LAN  to 
variations  in  the  number  of  users  and  the  traffic  volume  is 
investigated.  This  analysis  yields  the  result  that  end-to- 
end  performance  is  not  sensitive  to  significant  increases 
(up  to  a  factor  of  ()  in  the  number  of  users  and  the 
associated  traffic  volume  above  the  relatively  high 
volumes  used  in  the  reference  scenario. 


I.  INTRODUCTION 

Recent  developments  in  intra-building  or  intra¬ 
campus  communications  indicate  that  local  networking  is 
taking  a  path  distinct  from  traditional  long  haul 
communications  and  computer-to-peripheral  data  bases. 
The  unique  environment  of  local  networks  in  terms  of 
geographical  dispersion  (from  100  meters  to  1C  Km), 
transmission  speed  (between  9  Kbps  to  50  Mbps), 
unregulated  environment  and  support  of  dissimilar  devices 
and  protocols  is  summarized  in  [1]  .  Interoperability 
among  various  user  requirements  is  the  key  factor  to  the 
development  of  local  networks,  often  imposing  the  major 
cost  and  performance  constraints  instead  of  bandwidth, 
which  was  the  typical  scarce  commodity  in  a  more 
conventional  communications  environment. 

From  the  early  R4rD  efforts  in  Local  Area  Networks 
(LAN),  it  became  apparent  that  less  controlled  random 
access  methods  could  be  very  efficient  for  high-speed 
broadcast  media.  This  led  to  the  Carrier  Sensing  Multiple 
Access  protocol  with  Collision  Detection  (CSMA/CD)  as 
first  introduced  in  [2j  .  In  parallel,  experiments  showed 
that  off-the-shelf  cable  TV  technology  represents  an 
excellent  vehicle  for  implementing  broadcast  LAN  [3]  . 


More  recently,  the  data  communication  and  office 
automation  users  community  has  shown  great  interest  in 
local  networking.  This  interest  was  strongly  influenced  by 
extensive  vendor  activity,  new  hardware/software 
announcements,  the  IEEE  standards  committee  for  LAN, 
and  the  intensive  advertising  campaign  in  the  mass  media. 
The  technical  and  scientific  literature  is  flooded  by 
articles  with  the  main  controversial  issues  being  the 
transmission  method  (baseband  versus  broadband/RF)  [4]  , 
[5]  and  network  access  scheme  [6]  ,  [7]  .  Most  of  the 
analytical  work  is  dealing  with  performance  comparisons 
[8] ,  [9]  of  the  various  proposed  channel  partition 
methods,  in  particular  the  two  that  have  been  blessed  as 
IEEE  standards  by  the  LAN  committee,  Carrier  5ense 
Multiple  Access  with  Collision  Detection  (CSMA/CD)  and 
the  Token  Ring  [10]  . 

The  97%  channel  utilization  capability  of 
ETHERNET  reported  by  Shoch  and  Hupp  [11]  ,  completely 
ignores  the  processing  time  at  the  Network  Interface  Unit 
(NIU),  the  interaction  between  the  CSMA/CD  access  and 
higher  level  protocols,  buffer  management  at  NIU,  and 
the  nodes  speed  mismatch.  Watson  [12]  performed 
simulations  of  two  CSMA/CD  type  networks  with 
different  methods  of  buffer  management  and  speed 
mismatch  and  concludes  that  network  performance  is 
stongly  dependent  on  these  factors,  sometimes  reducing 
utilization  to  Jess  than  50%.  In  [13]  ,  we  presented  a 
detailed  modeling  of  two  microprocessor  architectures  for 
local  network  interface  adapters  and  analyzed  their 
impact  on  throughput/delay  performance  on  end-to-end 
(ETE)  character  transfer.  Our  studies  showed  that  the 
main  limitation  to  LAN  performance  is  due  to  the 
interlace  processing  and  qeueing,  while  its  sensitivity  to 
the  coaxial  cable  speed  and  access  protocol  is  rather 
limited. 

In  this  paper,  we  do  not  attempt  to  model  closely 
the  NIU  architecture  at  the  microprocessor  level.  Having 
established  in  [l  3]  that  the  bottleneck  exists  on  the 
protocol  processing  and  the  Packet  Assembly/Disassembly 
(PAD),  we  assess  at  the  macroscopic  level  the  ETE  delays 
lor  a  multiple  applications  scenario,  typical  of  an  office 
automation  environment.  The  NIU  is  modeled  as  a  single 
server  queue,  where  messages  are  input  from  a  Data 
Terminal  Equipment  (DTE),  (e.g  ,  a  TTY,  a  minicomputer 
port,  ?  word  processor,  a  digital  FAX  machine), 
packetized  appropriately  and  transmitted  to  another  NJU 
via  the  CSMA/CD  protocol.  Both  packet  and  message 
delays  *re  analyzed  among  various  types  ol  DTE's.  Before 
proceeding  in  describing  the  mathematical  models  and 
presenting  the  experiments,  we  establish  in  a  rigorous 
manner  the  delay  definitions: 


A2.3.1 
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Packet  Delay:  From  the  time  the  first  oit  o! 

a  packet  is  ready  to  be  sent 
at  the  source  NIU  to  the  time 
the  last  bit  is  received  at  the 
destination  NIU. 

Message  Delay:  From  the  time  the  first  bit  of 
the  message  is  ready  to  be 
sent  over  the  source  DTE 
access  channel,  to  the  time 
the  last  bit  of  the  message  is 
received  at  the  destination 
DTE. 

(A  message  is  typically  chopped  into  several  packets 
which  may  be  interleaved  as  they  travel  through  the 
network). 


The  aggregate  class  i  packet  arrival  rate  summed 
over  all  DTE's  transmitting  class  i  packets,  is  denoted  by 
Xj.  It  follows  that  the  total  cumulative  packet  rate  S, 
normalized  to  T j  is 

S:T,  Zxi 


S  is  related  to  the  total  offered  packet  traffic  C, 
including  retransmissions  due  to  collisions  and  busy 
conditions,  as 

Ce-aG 

S  =  - 

(X  ♦  a)  Ce‘aG  ♦  (1  ♦  aG)  (1  -e  -aG)2  +  | 


In  what  follows,  we  first  introduce  the  models  for 
the  packet  and  the  message  delay  analysis.  We  then 
describe  the  data-base  of  the  office  building  scenario  and 
subsequently  present  results  and  sensitivity  analysis. 

2.  MATHEMATICAL  MODELING 


Packet  Delay  Analysis 


where  X  is  the  packet  length,  weighted  by  the  class  mix 
and  normalized  to  Tjt 

E  XiTj 
i 

X  -  - 

Tl£  Xj 
) 


The  packet  delay  will  consist  of  the  transmission 
time  the  packet  spends  in  the  coaxial  cable  and  the  waste 
and  back-off  delays  due  to  busy  condition  or  collisions. 
We  follow  the  approximate  CSMA/CD  analysis  presented 
in  [1«]  ,  adapted  to  reflect  various  classes  of  packet 
lengths  depending  on  the  application  type. 

Let  Tj  be  the  transmission  time  of  packet  class  i, 
(including  all  link  level  overhead)  and  y  be  the  propogation 
delay  (in  nsec/ft).  Then  the  normalized  propagation  delay 
relative  to  packet  size  of  class  1  is 

y  •  L 

a  - - 

Tl 

where  L  is  the  maximum  cable  length.  This  assumption  is 
a  pessimistic  one,  since  it  implies  that  all  packet 
transmissions  will  encounter  the  maximum  possible 
propagation  delay. 


tamvAL  or 

COLltXNS  PACKET 


FIGURE  1:  ILLUSTRATION  OF  THE  AVERAGE  WASTED 

interval  due  to  collision  w 


The  packet  delay  per  class  i  will  be 
Dj  =  (A]  ♦  A2  ♦  A3)  Tj 

when  Aj  is  the  waste  due  to  collisions,  A2  the  dead  time 
due  to  the  back  off  algorithm  in  case  of  collision  or 
carrier  busy  and  as  the  propagation  and  transmission  time 
(all  normalized  to  T  | ).  It  can  be  shown  that 

Aj  =  N2  (N  ♦  a) 


A2  = 


—  (2  N2  *  1  - 
Tl 


A3  =  a 


Rl 

1)*(N1-N2) - 

Tl 


where 


G 


ae*aG 


\ 

(the  waste  due  to  collisions, 
Figure  1) 

(the  average  number  a  packet 
has  to  back-off) 


N2  *  (I  ♦  aG)  e*aG  -  1  (average  number  a  packet 
collides) 


R  I,  R2  are  the  mean  retransmission  intervals  in 
case  of  busy  condition,  or  collision  in  which  case  the 
binary  back-off  algorithm  is  applied  as  in  [2]  . 


Message  Delav  Analysis 


This  delay  will  consist  of  the  serialization  delay  at 
the  DTE  port,  the  protocol  execution  time  at  the  source 
and  destination  NIU,  the  CSMA/CD  packet  delays  as 
computed  above  and  queueing  delays  at  the  source  (  a 
packet  waits  to  be  transmitted)  and  destination  (packets 
wait  to  be  reassembled  into  a  message  to  be  transmitted 
over  the  serial  DTE  port). 


A2.3.2 


Several  types  of  DTE's  will  be  considered,  each 
transmitting  or  receiving  various  classes  ol  messages. 
Let 


Oi  (j,  k) 

The  ETE  delay  of  message  class  i 
from  a  DTE  of  type  j  to  a  DTE  of 
type  k 

tn 

Protocol  execution  time 

Dj 

Packet  delay  due  to  CSMA/CD 

TRj  (j) 

Class  i  packet  transmission  time 
and  type  j  DTE 

Mi 

Average  number  of  packets  per 
message  of  class  i 

QiIN(j) 

First  packet  queueing  of  class  i 
messages  on  access  queue  of  DTE 
type  j  port 

QjOUT(j) 

Packet  queueing  of  class  i,  at 
destination  queue  of  DTE  type  j 
port 

In  case  the  source  DTE  is  a  low  speed  DTE  or  host 
connected  via  multiple  parallel  ports,  then  QjlN(j)  =  0, 
since  no  queueing  is  practically  taking  place  at  these 
access  ports.  For  DTE's  connected  via  a  high  speed  port, 
the  NIU  may  have  to  buffer  incoming  messages  to  avoid 
overflow.  To  compute  QjlN(j),  we  note  that  a  message 
will  be  input  to  the  NIU  as  a  whole.  Thus,  the  M/M/l 
formula  on  a  message  basis  can  be  used  (assuming  Poisson 
message  arrivals  and  exponented  message  length)  to 
compute  the  message  queueing  reflected  at  the  first 
packet. 

At  the  destintion  DTE  packets  belonging  to  a 
message,  will  not  follow  one  another  at  regular  intervals 
and  may  be  interleaved  with  other  message  packets.  We 
assume  that  received  packets  are  Poisson  distributed  and 
have  exponential  length.  Thus  the  M/M/ 1  formula  on  a 
packet  basis  can  be  used  to  provide  QjOUT(j). 

To  compute  the  ETE  message  delay  we  distinguish 
between  two  cases. 

Case  1  -  The  delay  bottleneck  occurs  at  the 
destination  NIU.  This  will  happen  if  the  source  channel 
speed  is  higher  than  the  destination  channel  speed.  In  this 
case,  the  message  delay  will  be  dominated  by  the  packet 
queueing  and  transmission  time  at  the  destination  DTE 
port,  M,  rQjOUT(i<)  4  TRj(k)].  Other  components  to  the 
ETE  delav  include  message  protocol  processing  2Tjsj,  the 
CSMA/CD  delay  Dj  and  the  first  packet  queueing  and 
transmission  over  the  source  port,  as  illustrated  in  Figure 
2a.  It  follows  that 

D,  (j,  kla  Q.’Nfj)  ♦  TRj  <j)  ♦  2TN  4  Di  ♦ 

♦  Mj  [Qi°UT(k)  ♦  TRj  (k)] 
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DESTINATION 


B.  SOURCE  PORT  SPEED  <  DESTINATION  PORT  SPEED 


FIGURE  2:  ILLUSTRATION  OF  ETE  MESSAGE  DELAY 

3.  REQUIREMENT  ASSUMPTIONS 

The  models  above  have  been  used  to  evaluate 
deJay/throughput  characteristics  for  a  typical  office 
building  LAN.  The  maximum  length  of  the  single  coaxial 
cable  was  assumed  5,000  feet  ^enough  to  span  a  20  story 
average  office  building).  The  LAN  was  assumed  to 
support  1,000  terminals,  10  minicomputers,  25  word 
processing  centers  (each  including  several  CRTs  and 
floppy  disk  drivers),  and  20  digital  FAX  units. 
Communications  with  the  outside  world  were  supported 
via  a  high-speed  gateway  C 1 .544  Mbps)  fo  some  long-haul 
T  |  -  rate  line. 

In  Tables  1,  2,  and  3  we  summarize  the  device 
characteristics  and  the  traffic  volumes  they  feed  to  the 
network,  the  message  tvpes  and  the  routing  of  messages 
both  internally  and  to  the  gateway.  Note  that  traffic 
numbers  are  purposely  exaggerated.  For  example  a  CRT 
terminal  is  assumed  to  generate  120  messages/hour,  each 
message  approximately  consisting  of  kO  characters.  This 
corresponds  to  an  operator  being  able  to  type  SO 
characters/min.  continuously  without  waiting  for  a 
response!  Similarly  the  number  of  devices  in  the  building 
exceeds  current  office  automation  needs. 


Case  2  -  The  delav  bottleneck  is  the  source  NIU. 
This  occurs  if  the  source  channel  speed  is  lower  than  the 
destination  channel  speed.  Here  the  message  residence 
(ETE  delay)  will  be  dominated  by  the  transmission  time  at 
the  source  port  Mj  TRj  (j)  as  in  Figure  2b.  Thus 

Dj  (j,  k)  =xQ,lN(j)  .  Mj  TRj  (j)  ♦  2Tn  ♦  Dj  ♦ 

*  [QjOUT(k)  ♦  TRj(k)] 


U.  NUMERICAL  RESULTS 

The  packet  and  message  delay  analysis  were 
implemented  in  a  computer  package  called  PLAN 
(Performance  of  Local  Area  Networks).  This  program 
accepts  as  input  a  Data  Rase  on  requirements  in  a  format 
similar  to  Tables  I,  2  and  3  and  outputs  packet  and 
message  ETE  delays  per  device-type  pair. 
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TABLE  I:  DEVICE  CHARACTERISTICS 


TABLE  V  ROUTING  ASSUMPTIONS 


Device  Type 

Access 

Number 

(Kbps) 

Message 

Speed 

(Msg/hour) 

Rate 

1. 

Gateway  (GW) 

1 

1,544.0 

■ 

2. 

Word  Processing 

25 

9.6 

BS3 

3. 

Asynch.  Terminals 

600 

1.2 

H 

4. 

BSC  Terminals 

400 

9.6 

120 

5. 

Mini  Computers 

10 

(•*) 

(••) 

6. 

Low  Speed  FAX 

10 

1.2 

5 

7. 

High  Speed  FAX 

10 

9.6 

50 

(•)  The  gateway  traffic  is  evaluated  from  the  message 
routing  information  (Table  3). 


(••)  Two  configurations  are  considered: 

(1)  Multiple  Parallel  ports  per  mini 

File  transfer  ports  at  36  Kbps, 
generating  5  messages  (files)Zhour. 

-  Asynch.  ports  at  1.2  Kbps.  The  message 
volume  to/from  these  ports  is  computed 
from  the  routing  information. 

BSC  ports  at  9.6  Kbps.  The  message 
volume  to/from  these  ports  is  computed 
from  the  routing  information. 

(2)  Single  Multiplexed  port  at  128  Kbps. 


From 

To 

wp 

GW,  WP  (50%,  50%) 

TERMINAL 

GW,  MINI,  TERMINAL  (75%,  12.5%, 
12.5%) 

FAX 

GW  (100%) 

MINI  (file  XFER) 

GW,  MINI  (70%,  30%) 

MINI  (other) 

TERMINAL  (symmetric  to  other 
direction) 

GW 

FAX  (3  times  the  other  direction) 

GW 

MINI,  TERMINAL,  WP  (symmetric) 

Two  classes  of  experiments  were  performed  using 
PLAN.  The  first  class  included  studies  on  effects  of 
network  load,  cable  speed  and  packet  length  to  the  packet 
access  (CSMA/CD)  delays.  The  second  class  consisted  of 
experiments  on  ETE  message  delays  and  their  sensitivity 
to  various  factors. 

A.  Experiments  on  Packet  Delay 

In  Figure  3  we  plotted  the  average  packet  delay  for 
the  data-base  of  section  3  (hereafter  referred  to  as 
reference  scenario)  under  various  packet  length  choices. 
Very  small  packet  sizes  (less  than  2  Kbits)  result  into 
considerable  message  fragmentation  and  consequently 
large  protocol  overhead.  On  the  other  hand,  larger  packet 
sizes  result  into  unfairness  (a  single  packet  will  take  over 
the  cable  for  a  prolonged  time),  need  for  large  buffer 
sizes  and  incompatibilities  with  the  device  access  method. 
Thus,  the  optima)  choice  for  a  packet  length  as  indicated 
in  Figure  3  is  4  Kbits  above  which  no  significant 
performance  improvement  is  achieved.  Note  that  if  a 
maasag*  is  less  than  4  Kbits  it  will  form  a  single  packet 
for  transmission. 


TABLE  2:  NETWORK/TRAFFIC  PARAMETERS 


Packet  Header 
Cable  Capacity 
Cable  Length 
Retransmission 
Protocol  Processing 


236  bits 
1  •  10  Mbps 
5000  feet 
10  Packets 
40  sec/bit  (•) 


Message  Class 

Length  (Information  Kbits) 

Terminal  Input 
Asynch  Output 
BSC  Output 

Word  Processing 
File 

FAX  (page) 

0.48 

2.00  (••) 
8.00  (**) 
16.00 

200.00 

1,000.00 

•  As  assessed  in  |]1 30  . 

••  The  output  to  the  terminal  will  be  much  larger  than 
what  the  operator  types  in. 


FIGURE  3:  EFFECT  OF  PACKET  LENGTH  (INFO  BITS ) 


ON  THE  AVERAGE  PACKET  DELAY 


In  Figure  4,  the  packet  delay  is  given  as  a  function 
of  the  aggregate  load,  varying  up  to  3  Kbps  (the  cable 
capacity).  Our  reference  scenario,  generates 

approximately  300  Kbps.  Thus  the  cable  is  utilized  only 
at  10%  and  the  margin  for  growth,  insensitive  to  packet 
delays,  is  up  to  63%  utilization  (2  Mbps).  This 
demonstrates  that  the  access  method  is  not  a  significant 
factor  to  delays  under  a  wide  range  of  overload  scenarios. 
Note  that  our  reference  scenario  is  by  far  exaggerating 
traffic  volumes  in  a  typical  office  building. 


RCTKI4U  1.6*  SEC 

FIGURE  4:  EFFECT  Of  BUS  LOAD  ON  PACKET  DELAY 
B.  Experiments  on  Message  Delays 

In  Table  4  we  depict  the  ETE  message  delays  under 
the  parallel-port  and  single-multiplexed  minicomputer 
attachment  (as  expected,  the  single  high-speed  port  at 
128  Kbps  performs  better  than  parallel  ports  although  the 
basis  of  comparison  is  rather  weak).  Note  that  delay  can 
be  rather  large  (of  the  order  of  minutes)  due  to  the  device 
access  speed  and  queueing.  This  is  further  demonstrated 
in  Figure  5,  whereby  queueing  at  the  access  port  of  Word 
Processing  centers  (WP)  results  into  considerable  delay 
degradation  as  message  rate  increases.  Again  for  our 
reference  scenario,  and  its  vicinity,  delays  are  rather 
insensitive.  In  Figure  6,  we  show-  the  dramatic  effect  of 
the  single-multiplexed  port  speed  on  minicomputer  traffic 
as  it  results  from  the  reference  scenario. 

Subsequent  experiments  showed  that  there  is  no 
significant  sensitivity  on  the  number  of  devices,  assuming 
that  the  aggregate  load  remains  constant. 

5.  CONCLUSIONS 

In  summary  we  conclude  that  the  CSMA/CD  access 
delay  is  of  the  order  of  miliseconds,  orders  of  magnitude 
under  the  message  ETE  delays.  These  are  due  mainly  to 
limited  device  access  speeds  and  queueing  at  the  serial 
port  at  the  OTE-NIU  interface.  Furthermore,  it  was 
established  that  coaxial  cable  LAN's  can  support 
extensive  office  automation  requirements  up  to  six  times 
a  reference  scenario,  already  overloaded  without  any 
significant  performance  degradation. 


Note  that  the  LAN  technology  can  offer  a 
multiplicity  ol  the  3-10  Mbps  capacity  either  via 
internetting  among  various  coaxial  cable  structures  or  by 
exploring  the  virtually  unlimited  capability  of  RF 
frequency  division  multiplexing  in  the  VHP/UHF  CATV 
bands. 


TABLE  4:  END-TO-END  MESSAGE  DELAY  (SEC) 


BEP0CMCE  SCENARIO  •  MULTIPLE  WATVMM 
It  MBPS  CHANNEL 


C*  VP  MM  ATTN  IX  MVP  A*  LVPAX 

cv  ...  i.J  tt.t  l.t  M  in  n 

VP  l.t  X.l  —  —  —  —  — 

MM  It.t  —  n.«  ).l  l.t  —  — 

A SVN  It  —  I.)  ».l  l.f  -  — 

BSC  t.tt  —  I.I  t.’  t.I  —  — 

MVP  AK  IPS  —  -  -  —  - 

LVPAJI  X7  —  —  -  -  —  — 


acpauci  sc  cm  a*  in .  vmclc  povt/mm 
It  MBPS  CHANNEL 


cv 

VP 

MM 

ASTN 

BSC 

HVPAX 

LVPAX 

ct 

t.t 

It.t 

l.t 

t.t 

IPP 

n 

VP 

l.t 

t.I 

— 

M. 

MINI 

•  .1 

It.t 

l.t 

a.t 

— 

ASYN 

«.< 

t.P 

1.1 

a.  t 

— - 

... 

BSC 

a.tt 

_ 

I.I* 

t.P 

t.t 

— 

... 

HVPAX 

ITT 

_ 

— 

... 

LVPAX 

n 

— 

— 

— 

... 

... 

FIGURE  5:  EFFECT  OF  MESSAGE  RATE  ON  END-TO- 

END  MESSAGE  DELAY:  WORD  PROCESSING 
AND  BSC-ASYN  TERMINALS 
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TRAFFIC  LOADi  REFERENCE  SCENARIO 
CABLE  SREEO:  3-10  MBPS  (INSENSITIVE) 


FIGURE  6:  EFFECT  OF  PORT  SPEED  ON  MINI-COMPUTER 
MESSAGE  ETE  DELAY:  SINGLE  PORT 
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ABSTRACT 

This  paper  deals  with  analysis  and  design  issues 
encountered  in  the  development  and  evaluation  of 
microprocessor-based  interface  adapters  to  broadcast 
carrier  sensing  multiple  access  (CSMA)  local  computer 
networks.  Queueing  network  models  are  presented  and 
approximate  analytic  techniques  are  used  to  obtain 
throughput/delay  measures  associated  with  the 
interface  processors. 

In  the  first  part  of  the  paper,  the  protocol 
partition  issue  is  studied  as  it  is  implemented  via  two 
stage  architectures.  It  is  shown  that  off-loading  link 
level  functions  to  the  terminal  end  CPUs,  improves 
dramatically  the  system  throughput  versus  partitions 
between  separate  intelligent  boards  at  .ne  terminal  and 
the  network  ends  of  the  interface  unit.  In  the  second 
part,  a  closed  queueing  network  model  is  used  to  study 
the  interface  throughput  under  window  flow  control.  It 
is  shown  that  the  main  limitation  is  due  to  protocol 
processing,  while  performance  is  unaffected  by 
parameters  referring  to  the  network  overall  utilization 
and  speed  within  a  wide  region  of  operation. 

1.  INTRODUCTION 

Local  area  networks  (LANs)  are  rapidly  emerging 
as  a  major  and  distinct  class  ol 
computer  communications  networks.  Their  salient 
characteristics  make  them  particularly  attractive  as  an 
efficient  and  economic  solution  to  high  speed, 
unregulated  connections  within  a  limited  distance 
geographical  area  £l]  .  From  the  early  R&D  efforts,  it 
became  apparent  that  random  access  protocols  could  be 
more  efficient  for  high  speed  broadcast  media  over 
conventional  contention  resolution  techniques  used  in 
long  haul  networks.  This  led  to  carrier  sense  multiple 
access  protocols  (CSMA),  such  as  the  Ethernet  access 
scheme  [2] ,  employing  collision  detection  and  the 
hyperchannel  access  method  [3]  ,  a  hybrid  between 
CSMA  and  polling.  Another  class  ol  access  protocols 
use  token  passing  techniques,  a  variation  of  hub 
polling  (V) . 
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Various  design  issues  have  been  raised  by  LAN 
vendors  and  users.  Mainly  concerning  access  protocols, 
network  topology,  and  transmission  technology.  On 
access  protocols,  extensive  study  have  demonstrated 
relative  merits  of  the  two  prevailing  techniques,  CSMA 
with  collision  detection  and  token  passing,  e.g.  [5]  , 
[6]  ,  and  [7]  .  Among  topological  designs,  bus  (tree) 
networks  are  preferred  for  CSMA  protocols  and  rings 
for  token  passing.  Finally,  a  major  "dichatomy"  on 
transmission  technology  exists  between  modulated  RF 
broadband  cable  transmission  (employing 
CATV  components)  and  baseband  tapping  on  a  passive 
coaxial  cable.  All  the  above  issues  could  be  classified 
as  referring  to  the  lower  protocol  functionsjwhich  refer 
to  the  physical  layer  in  the  ISO  model  [8]  .  As  a 
consequence,  the  IEEE  LAN  Standards  Committee  [9] 
came  up  with  a  draft  proposal  which  includes  all  the 
major  physical  layer  options  (e.g.  bus,  CSMA/CD,  ring  - 
token  polling,  bus  -  token  polling)  and  builds  link  and 
network  level  standards  independent  of  the  physical 
layer.  This  recognizes  the  fact  that  other  protocol 
issues  may  become  more  important  than  the  access 
method  and  transmission  technology,  especially  in  view 
of  the  ample  bandwidth  (high  speed)  available  in  LANs. 

Immediately  related  to  the  protocol  function  in 
LANs  is  the  architecture  of  the  network  interface  unit 
(NIU),  within  which  most  of  the  the  lower  layers  are 
implemented.  These  multiple  microprocessor  units 
have  to  interface  a  variety  of  data  terminal  equipment 
(DTE),  including  high  speed  terminals,  multi-vendor 
hosts,  etc.  They  differ  from  conventional 
communication  processors,  especially  on  the  very  high 
output  speed  (e.g.  10  Mbps)  to  be  achieved  at  a  cost 
justifying  their  existence.  For  example,  on  a  per  port 
basis  an  NIU  supporting  asynchronous  terminals  (DTEs) 
should  not  exceed  the  cost  of  a  short  haul  modem. 

It  was  long  suspected  that  throughput  analysis 
ignoring  processing  times  at  the  NIU  and  interaction 
between  higher  level  protocols  failed  to  identify 
potential  bottlenecks,  more  severe  than  the  limits  of 
the  access  schemes  and  the  medium  speed.  As  an 
example,  the  97%  Ethernet  utilization  reported  in  [lOj 
ignores  the  limitations  of  the  NIU  architecture.  On  the 
other  hand,  simulation  results  on  Ethernet  type 
networks  with  different  buffer  management  and  speed 
mismatch  [l  l]  ,  showed  that  network  throughput  is 
very  sensitive  to  those  factors,  sometimes  reducing 
utilization  to  less  than  50%.  In  [|2]  we  presented  a 
detailed  queueing  network  model  o(  two  microprocessor 
architectures  lor  interfacing  clusters  of  terminals  to  a 
coaxial  cable  based  CSMA/CD  network.  Using  queueinR 
network  modeling  and  similar  analytical  methods  used 
in  performance  evaluation  of  computer  systems,  we 
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studied  the  effect  of  the  NIU  on  end-to-end  cfiar.it  ter 
transfer.  Our  studies  showed  that  the  mam  limitation 
to  LAN  performance  is  due  to  the  interface  processing 
and  queueing,  while  its  sensitivity  to  the  cable  speed 
and  access  protocol  is  rather  limited.  In  |J  j]  »t 
further  demonstrated  via  approximate  analytic  tools, 
that  for  realistic  office  automation  scenario,  the  delay 
bottleneck  exists  at  the  DTE/network  interface. 

These  studies,  apart  from  identifying  a  serious 
performance  limitation,  helped  us  to  acquire  a  valuable 
analysis/design  tool  to  study  NIU  architectures.  Their 
variations  in  the  design  variables  predict  weak  links  ana 
often  system  parameters  which  are  important  to  the 
evaluation  of  network  performance.  In  this  paper,  we 
first  summarize  the  trade-off  between  the  two 
architectures  studied  in  [l  ^  .  It  is  shown  that  clearly 
one  class  of  NIU  designs  is  superior  to  the  other.  An 
improved  architecture  (within  the  best  class)  is  then 
outlined  and  an  analytic  tool  is  presented  to  model  and 
evaluate  the  performance  of  such  designs.  Finally, 
sample  numerical  results  are  provided,  along  with 
sensitivity  studies  on  NIU  throughput  and  delays  or 
functions  of  critical  processing  times  and  protocol 
parameters. 

2.  TRADE-OFF  STUDIES  ON  TWO  NIU 

ARCHITECTURES 

In  most  implementations,  NIU's  consist  of  two 
basic  units:  (1)  the  terminal  interface  unit  (T1U)  and  (2) 
the  transceiver  unit  (TRU).  The  former  serves  the  user 
end  of  the  interface,  while  the  latter  the  network  end. 
In  our  model  we  assumed  that  each  NIU  includes  up  to 
four  TIU  boards;  every  TIU  can  service  two  to  eight 
FOX  ports  at  speeds  up  to  9.6  Kbps,  as  shown  in  Fig.  1. 
Each  TIU  has  a  dedicated  8  bit  N.MOS  microprocessor 
with  a  1  microsec  minimum  instruction  cycle,  like  8085, 
6809,  or  Z80.  The  TIUs  and  the  TRU  are 
interconnected  via  a  parallel  master  bus,  having  a  speed 
equal  to  the  coaxial  cable  transmission  speed  (4  or  10 
Mbps.) 

The  two  architectures  referred  to  as  architecture 
A  and  architecture  B,  differ  at  the  TRU  block: 

(a)  Hardware 

Type  A  TRU  has  a  CPU  and  memory  on 
board,  which  controls  the  DMA  transfer 
to/from  the  TIUs  and  an  HDIC  LSI  chip 
controller,  serviced  by  the  CPU. 

Type  B  TRU  has  no  CPU  or  memory  on 
board.  The  DMA  transfer  to/from  TIUs  are 
controlled  by  the  TIUs  and  an  arbitration 
logic.  The  frame  functions  are  performed 
by  MSI  hardware  (such  as  CRC  and  flag 
generations,  carrier  sensing,  etc.) 

(b)  Protocol  Level  Partition 

In  NIU  Type  A,  all  physical,  link,  and  partly 
network  level  functions  are  pet  formed  by 
the  TRU  with  the  TIU  perfcming  only 
packet  assembly/disasscrnbly  (PAD) 
routines.  In  NIU  type  B,  tre  TRU  is  merely 
performing  physical  level  functions  (except 
the  backoff  timer)  and  no  link  functions. 


2.1  NIU  Modeling 

In  order  to  analyze  the  performance  of  the  two 
architectures,  we  modeled  each  CPU  board  (the  TIUs 
and  the  TRU  for  NIU-A)  as  a  central  server  network 
with  priorities  and  feedback  passes  through  the  central 
queue.  The  internal  parallel  bus  of  a  CPU  board,  which 
is  exclusively  required  at  every  processing  pass,  is  the 
"commodity"  for  which  requests  are  queued-up.  As  an 
example,  consider  the  TIU  queueing  model  for  NIU-A  as 
give  in  Fig  .2.  Packets  from  the  terminals  enter  the 
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B1U  at  rate  \jn  and  after  experiencing  the  serialization 
delay  Din  hold  the  parallel  bus  for  the  duration  of  the 
input  interrupt  processing.  A  second  pass  through  the 
bus  models  the  PAD  processing  functions  and  then 
packets  reside  in  RAM  waiting  to  be  fetched  by  the 
DMA  controller.  If  the  DMA  transfer  is  done  on  cycle 
stealing,  it  will  be  transparent  to  the  bus  functioning 
apart  from  slowing  down  background  processing. 
Otherwise,  the  DMA  transfer  is  modeled  as  a  third  pass 
with  pre-emptive  priority  over  background  processing. 
Similar  passes  will  occur  on  packets  routed  from  the 
TRU  to  one  of  the  TIU  serial  ports.  For  a  detailed 
specification  oi  the  packet  flow  in  the  various  stages 
and  the  processing  times  of  each  pair  through  the 
central  server  the  reader  ,s  referred  to  [l  l]  . 

To  integrate  two  various  stages  into  the  NIU 
model,  two  approaches  are  taken  depending  on  the 
architecture: 
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For  NIU  A  -  the  queueing  networks  representing 
TIUs  and  TRU  are  interconnected  via  a  polling 
mechanism  which  simulates  the  DMA  controller, 

as  in  Fig.  J-A. 

For  NIU  B  -  the  TRU  is  modeled  as  a  FCFS  queue 
with  service  rate  equal  to  the  coaxial  cable  speed. 
The  server  will  be  disabled  (switch  in  oil  position, 
Fig.  3-B)  if  the  carrier  on  the  network  is  sensed 
busy.  The  BIUs/TRU  coupling  via  arbitration  is 
also  modeled  as  a  FCFS  mechanism. 


2.2  Results  and  Conclusions 


The  models  were  used  to  run  experiments  and 
perform  sensitivity  studies.  When  the  two 
architectures  are  lightly  leaded,  they  provide  similar 
delay  performance.  As  traffic  increases  N1U-A 
saturates  at  the  TRU  CPU  for  link  level  processing. 
For  NIU-B,  saturation  occurs  due  to  contention  among 
TIUs  to  access  the  TRU  pipeline. 

For  the  set  of  parameters  used  and  for  packets 
including  only  1  character  of  information  (10  bits),  the 
analysis  showed  (Fig.  ft)  that  NIU-B  saturated  at 
terminal  traffic  of  3,000  characters/min  whereas  NIU- 
A  saturated  at  150  characters/min  only.  Clearly,  NIU- 
A  suffers  from  the  separation  of  link  protocol  functions 
between  TIU-TRU  in  series  and  the  single  TRU  is  the 
bottleneck  of  the  system.  NIU-B  achieves  the  highest 
possible  degree  of  parallel  processing  among  TIUs,  with 
the  TRU  acting  as  a  mere  transceiver.  Thus  results  the 
15-fold  improvement  over  NIU-A.  The  single  character 
per  packet  requirement  was  imposed  from  real  echo 
FOX  specifications  and  rendered  throughput  calculation 
quite  pessimistic.  The  results,  however,  are  very  useful 
for  comparative  purposes. 
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FIGURE  «:  COMPARATIVE  NIU  PERFORMANCE 


Another  interesting  study  concerns  the  sensitivity 
on  NIU  throughput  and  delay  from  variation  of  the 
network  (coaxial  cable)  utilization.  As  seen  in  Fig.  ft, 
the  NIU  performance  is  almost  insensitive  to  network 
utilizations  from  10  up  to  ft0%.  Higher  utilizations 
result  into  excessive  carrier  busy  and  collision 
conditions  and  the  performance  of  the  system  degrades. 

In  conclusion,  it  appears  that  the  main  limitation 
on  LAN  performance  is  due  to  interface  processing  and 
queueing,  while  its  sensitivity  to  the  cable  speed  and 
access  protocol  is  rather  limited.  Furthermore, 
performance  is  greatly  affected  by  the  protocol 
partition  among  TIU  and  TRU  with  bottlenecks  arising 
if  the  three  lower  layers  (physical,  link  and  network) 
are  allocated  in  both  ends.  A  pipeline  TRU  requite-, 
customized  hardware  and  lacks  the  flexibility  in 
adapting  to  different  access  protocols  through 
software,  but  exhibits  impressive  improvement  on  the 
NIU  throughput.  In  the  next  section,  we  show  how  to 
approach  the  modeling  and  analysis  of  3  more 
sophisticated  version  of  NIU-B,  not  limited  to  Singh- 
character  packets  and  incorporating  window  I  low 
control  to  keep  memory  sizes  within  reasonable  limits. 


ANALYSIS  OF  AN  IMPROVED  NIU 
ARCHITECTURE 


The  trade-off  studies  reported  above 
demonstrated  the  merit  of  offloading  the  TRIJ  from  all 
link  level  functions,  except  those  performed  by 
hardware  at  the  speed  of  the  network  (CRC  checking, 
flag  generation  and  incoming  packet  address  filtering). 
The  models,  however,  were  of  limited  practical  use. 
since  they  assumed  single  information  character  per 
packet  and  did  not  model  flow  control  techniques  at  the 
network  level,  in  order  to  alleviate  buffer  overflow  and 
speed  mismatching.  In  what  follows,  we  will  highlight 
an  improved  NIU  architecture  which  implements 
variable  packet  sizes  and  end-to-end  flow  control. 
Then  we  present  a  queueing  network  model  and  an 
analytic  technique  to  obtain  throughput/delay 
measures.  The  analysis  is  applied  to  study  N'U 
maximum  throughput  and  its  sensitivity  to  various 
design  parameters. 


Architecture  and  Simplifying  Assumptions 


We  consider  an  NIU  interfacing  data  terminals  to 
a  broadcast  CSMA  cable.  As  before,  the  NIU  consists 
of  a  number  of  parallel  TIU  CPU  boards  (up  to  four)  and 
a  TRU  with  no  CPU  or  memory  on  board.  The  DMA 
transfer  between  TIUs  and  TRU  is  performed  via 
arbitration  on  a  master  bus  at  speed  equal  to  the 
coaxial  cable  speed  (typically  10  Mbps).  Cycle  stealing 
is  assumed  although  burst  transfers  could  be  modeled  as 
well. 

Each  TIU  provides  a  number  of  serial  ports  for 
terminal  interfacing.  Typically,  the  number  of 
terminals  supported  pier  NIU  varies  from  eight  4.8  Kbps 
terminals  to  four  9.6  Kbps  and  one  56  Kbps  high  speed 
DTE.  As  in  Fig.  5,  a  task  queue  controls  the  CPU 
processing  from  the  I/O  interrupt  to  packet 
assembly/disassembly,  address  formation  and  control 
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field  setting.  The  completed  data  packet,  with  its  "TIU 
header",  Fig.  6,  will  then  reside  in  RAM  awaiting  for 
DMA  transfer.  The  TIU  processing  time  will  be  given 
approximately  by  the  linear  formula. 

TIU  Processing  =  p  ♦  q-  N 

Where  p  is  a  fixed  processing  time  per  packet,  N 
is  the  number  of  information  byte  per  packet  and  q  the 
interrupt  processing  time  per  bit.  Typically,  q  =  0.01 
msec/bit  and  p  =  10-70  msec  per  input  data  packet  or 
p  =  5  -  35  msec  for  ACKs  and  output  packets.  Here, 
input  refers  to  the  DTE-to-network  direction  and 
output  to  the  network-to-DTE. 
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Terminal  data  are  packetized  upon  occurrance  of 
either  of  the  following  events: 

(1)  carriage  return  of  other  control  character. 

(2)  buffer  space  full  (typically  2048  bits) 

(3)  no  new  character  arrived  from  a  DTE  within 
a  certain  window 

(typically  10  to  50  msec.) 

In  the  case  of  terminal-to-computer  interactive 
mode,  which  is  the  basis  of  our  model,  the  information 
field  of  host  response  packets  is  usually  ten  times 
longer  than  a  terminal  query.  In  our  analysis,  we 
assume  that  terminal  packets  are  exponentially 
distributed  with  mean  10-100  bits  (1-10  characters)  and 
response  packets  are  exponential  with  mean  100-1000 
information  bits.  These  numbers  are  consistent  with 
measurements  on  5.6  Kbps  terminals  performing 
interacting  computing. 

The  DMA  cycle  stealing  mechanism  is  assumed  to 
slow  down  the  CPU  activity  by  a  factor  of  30%  during 
actual  data  transfer.  We  ignore  DMA  controller  turn¬ 
around  and  initialization  tunes  by  assuming  that  each 
DMA  transfer  direct  'on  is  associated  with  a  separate 
DMA  channel.  Hence  reinitialization  is  done  during 
transmission  on  the  opposite  direction.  It  may  be 
desirable  to  have  direct  memory  access  between 
adjacent  TIU's  and  implement  memory  shared  parallel 
processing.  This  option  is  not  incorporated  in  our 
present  performance  analysis. 

Date  packets  (including  the  TIU  40  bits  overhead. 
Fig.  6),  are  processed  at  the  TRU,  where 
synchronization  bits,  flags  and  CRC  bits  are  generated 
at  the  coaxial  cable  speed  ( I  - 1 0  Mbps).  The  TRU 
ov<-  head  (48  bits)  is  illustrated  in  Fig.  6.  The  TRU  is 
responsible  for  sensing  the  carrier,  deferring 
transmission  upon  busy  condition  and  retry  alter  a 
random  retransmission  period.  In  our  analysis,  we 
assumed  that  the  random  interval  has  mean  i  msec.  No 
collision  detection  was  modeled  although  it  could  be 
simply  incorporated  in  the  model. 


396 


Collided  or  otherwise  destroyed  packets  would  De 
detected  via  the  acknowledgement  mechanism,  ol  the 
link  protocol.  In  Fig.  7  a  logical  link  is  established 
through  the  network  between  interacting  OTEs.  On 
that  link,  reliable  exchange  is  guaranteed  by  an 
acknowledgement  (ACK)  mechanism.  Each  query 
packet  (or  group  of  packets)  is  answered  by  a  special 
ACK  packet  (or  negative  ACK  in  case  of  error  or 
collision)  or  an  ACK  bit  piggybacked  on  information 
packets  (lowing  in  the  opposite  direction.  The  window 
is  the  maximum  number  of  outstanding  packets 
awaiting  ACK’s.  In  this  paper,  we  considered  only  the 
case  of  window  size  one.  In  other  words,  the  protocol  is 
a  stop-and-wait  scheme,  whereby  no  packet  is 
transmitted  until  the  previous  one  is  correctly 
acknowledged.  Negative  acknowledgements  would 
result  in  retransmissions  of  the  same  packet.  We 
assumed  that  the  delay  to  receive  a  negative  ACK, 
automatically  introduces  the  random  deferment  needed 
in  random  access  schemes.  Retransmitted  packets 
follow  the  same  route  from  the  TIU  RAM  to  the  TRU 
transfer  via  DMA.  The  assumption  of  a  single  packet 
window  limits  the  maximum  throughput  yf  the  system. 
It  is  however  a  simple  technique,  especially  suitable  for 
interactive  applications  where  the  network  induced 
delay  is  orders  of  magnitude  smaller  than  user  think 
time  and  query  processing  at  the  host  end.  An 
extension  to  larger  window  sizes  can  be  easily  derived 
using  the  same  modeling  methodology. 
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3.2  Queueing  Network  Model 

The  NIU  architecture  is  modeled  as  a  closed 
queueing  network  using  the  methodology  outlined  in 
[14]  .  The  finite  population  of  "|obs"  travelling  within 
the  network,  equals  the  number  of  terminals  connected 
to  the  NIU.  These  jobs  will  be  either  packets  under 
processing  at  the  TIU  CPU,  packets  waiting  to  be 
transmitted,  ACKs  travelling  bark  to  the  NIU,  host 
originated  responses  to  be  transmitted  to  the  terminals, 
or  "permits”  at  the  user  terminals,  whereby  a  response 
has  been  received  and  the  te-minal  is  either  in  the 
process  of  serializing  a  new  packet  or  simply  in  a 
"thinking"  state.  Each  user  has  a  corresponding  "job"  in 
one  of  the  above  states. 

In  Fig.  8,  the  queueing  network  is  illustrated  for 
two  TfU's  per  NIU,  each  interfacing  two  terminals.  The 
terminals  (small  circles  to  the  left),  have  a  pure  delay 
associated  with  them,  representing  the  lime  needed  to 
input  a  new  packet,  when  no  other  packet  from  this 
user  waits  at  the  TIU  buffer.  This  delay  will  be  zero  in 


case  of  heavy  users  who  do  not  wait  for  a  response  to 
generate  a  new  query.  In  some  interactive 
environments,  users  do  not  fill  the  interface  buffer 
until  they  see  the  echo  from  the  previous  packet.  In 
such  case,  the  "idle"  time  includes  serialization  of  input 
and  output  characters,  and  user  think  time.  Due  to  this 
excessive  delay,  piggybacking  ACK’s  is  impossible  and 
stand-alone  ACK’s  are  used.  In  our  analysis  we 
assumed  heavy  terminal  users  and  mostly  piggybacked 
ACK’s  in  order  to  study  maximum  thoughput  scenario. 
Obviously,  maximum  throughput  is  achieved  when  there 
is  always  at  least  one  packet  ready  to  be  transmitted 
upon  receipt  of  a  response  and  the  pure  delay  at  the 
terminals  is  zero. 

The  CPU  processes  a  mix  of  packets  consisting  of 
ACK’s,  input  and  output  packets.  It’s  service  time  is 
slowed  down  due  to  the  DMA  cycle  stealing. 

The  control  queue,  serves  all  "ready"  packets. 
Due  to  the  FCFS  nature  of  the  DMA  arbitration 
mechanism,  we  include  all  TIU  RAM’s  within  a  single 
buffer,  although  this  is  not  physically  the  case.  The 
same  mix  of  packets  are  served  as  in  the  TIU  CPU,  with 
rate  equal  to  the  DMA  transfer  (or  the  coaxial  cable 
speed).  However,  due  to  the  CSMA  protocol,  the  server 
is  modeled  with  an  ON/OFF  switch,  representing  the 
unavailability  of  service  when  the  carrier  is  sensed 
busy.  The  switch  will  increase  the  effective  service 
rate  by  introducing  retrial  intervals  weighted  by  the 
probability  of  busy  condition  (see  Appendix  2).  We 
assume  that  the  reserving  interval  is  random  with  a 
mean  of  5  msec. 


Upon  completion  of  service  at  the  DMA  queue, 
host  responses  and  ACK’s  are  sent  back  to  the  CPU.  As 
explained  above,  a  user  data  packet  will  generate  either 
a  negative  ACK  on  collision  or  a  positive  ACK 
(stand-alone  or  piggybacked  within  host  response 
packets).  ACK’s  will  be  generated  at  the  destination 
end  after  a  delay  equal  approximately  to  C  --  5- IS 
msec.  Host  response  packets  will  be  either 
immediately  transmitted  (if  they  already  wait  in  RAM) 
or  need  to  be  generated  after  some  random  "idle"  time. 
In  Fig.  8,  ACK  generation  and  host  response  times  are 
modeled  as  pure  (non-queueing)  delay  elements.  For 
maximum  throughput  studies,  we  assumed  that  all 
positive  ACK’s  are  piggybacked  on  host  packet  ready  to 
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be  transmitted.  Titos  in  heavy  trjffic  scenario,  r0  r  Ty 
r  0,  or  users  and  hosts  generate  p.e  kets  within  the  time 
delay  of  the  flow  control  window. 

An  enact  analysis  of  the  closed  queueing  network 
of  Fig.  8,  cannot  be  obtained  due  to  the 
non-exponential  mix  of  queues  I  and  2.  Reasonable 
results  may  be  obtained  via  the  FCFS  approximations  in 
mean  value  analysis  found  in  fl5]  .  The  network  is  a 
multichain  model,  where  each  Til  I  board  corresponds  to 
one  Cham  with  chain  population  equal  to  the  nuinoer  o( 
terminals  per  TUJ.  In  Appendix  I,  we  provide  further 
details  on  the  analytic  technique  as  implemented  for 
the  single  chain  (one  T1U/TRU)  simplified  version 
reported  in  3.3. 

3.3  Numerical  Examples 

The  modeling  and  analytic  technique  presented  in 
3.2  was  implemented  for  a  single  TIU  per  NIU 
configuration  as  in  Fig.  9.  Several  parallel  TIU's  per 
NIU  can  definitely  be  studied  as  well,  but  since  most  of 
the  existing  LAN  interfaces  do  not  support  such 
parallelism  for  cost  considerations,  we  preferred  to 
limit  this  study  to  presently  relevant  architectures. 
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A  highlight  of  the  algorithms  used  in  our  analysis 
is  given  in  the  two  Appendices.  Our  design  parameters 
were  set  initially  to  the  values  reported  in  3.2;  more 
specifically  we  assumed: 

Number  of  Terminals  per  NIU:  U;» 

Cable  Speed:  C  =  10  Mbps 
Input  Data  Packet:  Xin  =  100  bits 
Output  Data  Packet:  Xout  =  1000  bits 
TIU  Overhead:  40  bits 
TRU  Overhead:  44  bits 

Processing  Time  per  Input  Packet:  p  =  1 0  msec 
Processing  Time  per  ACK/Output  Packet  *  p/2 
Interrupt  Time  per  bit  =  0.01  msec 

Carrier  Resensing  Delay:  5  msec 
DMA  Cycle  Stealing  Fraction:  30% 

To  assess  the  CSMA  parameters,  we  also  need  the 
network-wide  cable  utilization  s  and  the  cable 
propagation  delay  a.  We  assumed  that  these  were  not 
affected  by  the  individual  NIU  traffic  variations 


(except  changes  on  the  packet  sire  which  are  assumed 
to  be  universal).  This  is  a  reasonable  assumption  (or  a 
broadcast  LAN  with  200  NIU's  connected  to  it,  each 
with  different  throughput.  The  s  and  a  parameters 
were  taken  as: 

s  =  22% 

a=  1.35  microsec. 

In  Fig.  10,  we  present  the  NIU  throughput  and 
delay  variations,  as  the  terminal  and  host  "idle"  times 
vary.  Note  that  by  NIU  throughput,  it  is  meant  the 
aggregate  amount  of  information  bits/sec  flowing  in 
both  directions.  By  NIU  delay,  we  mean  the  additional 
time  a  terminal  generated  data  packet  has  to  wait  for 
transmission  due  to  NIU  processing.  It  is  apparent  from 
the  figure,  that  maximum  throughput  values  are 
obtained  with  very  short  idle  times  (packets  always 
wait  in  memory  to  be  transmitted  as  soon  as  the 
window  opens).  The  effect  of  the  TIU  processing  tune 
p,  is  dominant  in  heavy  traffic  conditions.  As  idle 
times  increase,  the  throughput  is  limited  by  external 
factors  rather  than  the  NIU  bandwidth.  Similarly 
delays  depend  critically  upon  p. 
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Network  parameters  were  shown  to  have  a 
minimal  effect  on  NIU  throughput.  Specifically,  with 
cable  speeds  I  Mbps  and  10  Mbps,  maximum  throughput 
values  were  46,46  Kbps  and  47.06  Kbps  respectively,  for 
the  same  network  utilization.  In  Fig.  II,  we  plotted 
the  maximum  NIU  throughput  versus  network 
utilization.  For  utilizations  less  than  $0%,  NIU 
performance  was  basically  insensitive  to  s.  The 
instability  at  utilizations  higher  than  60%,  is  due  to  the 
CSMA  saturation. 
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Finally,  in  Fig.  12,  we  studied  the  effect  of 
packet  sizes  on  NIU  throughput,  NIU  delay  and 
"round-trip”  delay  (the  time  elapsed  from  closing  the 
window  at  the  terminal  NIU,  till  a  response  on  ACK 
from  the  host  reopens  it).  It  is  important  to  note  that 
although  the  throughput  increases  with  the  packet  size 
(since  more  data  bits  experience  the  fixed  processing 
time  per  packet),  the  curve  approaches  some  limit, 
whereby  the  bit  interrupt  processing  and  cable 
transmission  delay  offset  the  advantage  gained  from 
larger  packet  sizes.  Note  that  the  packet  size  is  mainly 
limited  by  other  considerations,  such  as  buffer  size  and 
packetization  latency. 

4.  SUMMARY  OF  CONCLUSIONS 

A  conclusion  consistent  with  experience  of  LAN 
users  and  vendors  is  that  the  network  parameters  do  not 
greatly  affect  throughput/delay  performance,  if  the 
cable  is  utilized  below  saturation  (e.g..  60%).  The 
limiting  (actor  in  most  cases  reside  at  the  interface 
unit,  where  bandwidth  can  be  much  lower  than  the 
network  capacity  to  carry  data. 

In  analyzing  the  internal  NIU  behavior,  we  first 
identified  that  protocol  partition  within  the  NIU  is  an 
important  design  factor.  Specifically,  execution  of  the 
link  protocol  at  the  tranceiver  unit  by  a  centralized 
CPU  for  all  terminal  interface  boards,  appear  highly 
inefficient.  On  the  other  hand,  an  architecture 
whereby  the  terminal  interface  units  control  their 
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associated  links,  with  the  tranceiver  performing  basic 
transmission  functions  in  hardware,  achieves  a  high 
degree  of  parallel  processing  and  alleviates  the  control 
bottleneck. 

The  second  part  of  the  paper,  dealt  with  modeling 
and  analyzing  an  improved  NIU  architecture.  The 
window  flow  control  of  the  end-to-end  link  was 
modeled  via  closed  queueing  network  methods  and 
approximate  analysis  were  performed.  It  was  found 
that  the  NIU  throughput  was  very  sensitive  to  the 
protocol  processing  delays,  where  as  cable  speed  and 
utilization  do  not  have  any  obvious  effect  in  normal 
operating  conditions.  An  important  design  parameter 
(apart  from  the  CPU  speed),  is  the  packet  size  with  the 
trade-off  being  between  throughput  and  buller  size. 

The  above  results  were  obtained  for  a  simple  NIU 
architecture  (one  terminal  interface  unit  per  MU)  a. id  a 
stop-and-wait  (window  1)  protocol.  Immediate 
extensions,  include  more  than  one  parallel  boards  oir 
NIU  and  window  sizes  greater  than  1.  Further  areas  of 
research,  include  modeling  the  sources  (terminals)  mot.; 
precisely,  assess  quantitatively  buffer  size 
requirements  and  improve  the  approximate  analysis  on 
the  FCFS  non-exponential  queues. 
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Description  of  the  Algorithms 


In  order  to  compute  the  first  and  second  moments 
of  the  service  times  at  queue  I  and  2,  we  define  the 
probabilistic  breakdown  of  the  traffic  mix  and  weight 
means  and  second  moments  of  various  components 
appropriately.  As  an  example,  for  queue  I,  we  have 
ACKs  short  (input)  and  long  (output)  data  packets. 

At  queue  one,  the  service  tune  r\  must  also 
include  the  DMA  cycle  stealing  effect.  Thus,  if  Tj  is 
the  service  time  without  DMA  overhead  and  Utru  is 
the  TRU  utilization  (fraction  of  time  data  are 
transmitted  or  received). 

T,  -  ,  T' 

1  l  -  0.3Utru 

Here,  the  cycle  stealing  is  assumed  to  comsune 
30%  of  the  CPU  power.  The  utilization  of  the  TRU  is  a 
function  of  the  N1U  throughput.  This,  however,  is  not 
known  a  priority  and  an  iterative  procedure  is  used, 
whereby  an  initial  run  is  performed  assuming  UfRy=  0 
and  subsequently  new  iterations  correct  t |  until 
convergence  is  reached.  In  our  numerical  examples,  the 
procedure  converged  within  1-2  iterations. 

At  queue  two  the  service  time  r2  must  include 
the  effect  of  CSMA  carrier  busy  delays.  Let  Ps  be  the 
probability  the  carrier  is  sensed  busy  as  computed  in 
Appendix  2  and  Ds  the  mean  interval  at  which  the  TRU 
retries  to  sense  the  cable.  If  this  interval  is  uniformly 
distributed  and  T2  is  the  service  time  with  Ps  x  0,  we 
have  that 

T2  x  r2'  ♦  Ns  Ds 

E(T22)  ,  E(T^2)  +  *  Dj2  Nj2  (l  ♦  Ns) 


where  Ns  r 


the  average 
number  of 
retrials 


The  throughput/delay  evaluation  for  the  finite 
population  network  of  Fig.  9  follows  the  single  chain 
mean  value  analysis  for  closed  queueing  networks  as 
reported  by  Reiser  in  [l  5]  .  Due  to  the  non-exponential 
servers  no  product  form  solution  exists  and  the  FCFS 
approximate  technique  of  [l  5j  is  used.  The  input 
consists  of  To,  T|,  E(Tj  2),  t2,  E(T22),  T3,  Tu  and  the 
average  number  of  retransmissions  of  collided  packets, 
Nc. 

The  algorithm  proceeds  as  follows: 

1.  set  w  i  1,  A(0)  .  Aj  =  A2  -  nl  -  n2  -  0 

2.  tf  s  T 1  ♦  r  1  (n| -XT|)  ♦  0.5  A|  Eff  |2) 

t2  -T  2  *  r  2  <n2  *A.T2)  ♦  0.5  A2  E(T22) 

3.  t=  T0  ♦  Nc  Tj  *  <2  ♦  Nc)  1 1  ♦  2(1  ♦  Nc)  t2 

A(w)  x  w/t 

Aj  x  (2  ♦  Nc)  A(w) 

A2  x  2  (1  ♦  Nc)  A(w) 

nl  *  *1 A  | 

n2  x  t2  A2 

9.  IF  w  x  K  (the  number  of  terminals/NIU),  STOP; 
ELSE,  w  •-  w*l 

END;  C°  T°  2 


CSMA  Parameters 

We  used  the  pure  CSMA  throughput  formulation 
as  reported  in  [l6].  It  assumes  that  Poisson  sources 
(NIUs)  connected  to  the  cable  transmit  packets  of 
length  d  sec  at  a  rate  S  packets/sec.  The  packet 
length  is  computed  as  the  average  among  input,  output 
and  ACK  packets.  The  cable  propagation  delay  was 
assumed  ax  1.J6  microsec.  If  the  total  scheduled  rate 
C  (including  unsuccessful  and  collided  attempts)  is 
Poisson,  the  following  formula  applies: 

_L_  =  _ L— _ x  Pr  {success} 

C  G(2a  ♦  d)*e'aG 

the  probability  of  collision,  Pc  is  given  by: 

I  _  ^-aG _ 

Pc  =  G(2a  ♦  d)*e-*G 

and  the  probability  ot  busy  carrier  Ps  by 

psi1.pc._s_ 
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